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PREFACE 


Conditions being what they are at the present day, the dissemina¬ 
tion of knowledge has to be carried on to a large extent by means 
of books. This applies even in such a subject as phonetics, which 
is a science involving a particular skill that has to be imparted 
mainly by oral teaching. Books on scientific subjects are unsatis¬ 
factory instruments; they lag behind the advance of knowledge, 
but they are nevertheless liable to be taken by readers as embodying 
orthodoxies of one kind or another. And orthodoxies are by their 
nature impediments to progress in science. I would therefore 
express the hope that no reader of the present book will regard 
it in any sense as a final or unalterable exposition of the phonemic 
aspects of phonetics, that they will not consider the opinions 
expressed in it to be dogmatic pronouncements, but that they 
will rather look upon the work as an unfinished collection of materials 
which may encourage them to contribute to progress by pursuing 
useful lines of investigation for themselves. 

It is my hope too that readers will also take due account of the 
limited value of writing as a means of suggesting ideas. I share 
L. R. Palmer’s opinion that “speech is nothing more than a series 
of rough hints which the hearer must interpret in order to arrive at 
the meaning which the speaker wishes to convey .’’ 1 Words, clever 
inventions though they are, form in my view a quite inadequate 
medium for symbolizing any but the most elementary conceptions. 
Most words have many meanings, shades of meaning and con¬ 
notations; they therefore often have to be used loosely in order to 
avoid cumbrous sentences which even if set out in full would still 
be ambiguous. It is accordingly with intention that I have at 
times for convenience used loose modes of expression (such as "the 
sound X constitutes a single phoneme,” "treating the sound X 
as a phoneme separate from tne sound Y,” "in this language the 
vowels are subject to wide variation”) in the hope that readers 
will form a correct idea of what is meant. 

I would ask readers further to bear in mind that certain funda¬ 
mental concepts in phonetics, as in other sciences, are incapable of 
definition . 2 It would appear that all sciences lack firm foundations, 

1 L. R. Palmer, Introduction to Modern Linguistics, p. 82. 

* See my article Some Thoughts on the Phoneme in the Transactions of the 
Philological Society, 1944 
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being established on unproved assumptions that certain basic units 
exist and that people know what they are without explanation. 
We assume that everyone perceives the fundamentals in the same 
way, and we are apt to forget that what appears to one person 
an obvious truth does not necessarily appear so to others. These 
are additional reasons why statements made in this book, as in 
other books of a scientific nature, must be received with reserve 
and treated as provisional only. 

The idea of the phoneme is no new one. It was first introduced 
to me in 1911 by the late Professor L. SCerba of Leningrad, but 
both the theory and the word itself date back to more than thirty 
years before then. According to J. R. Firth,* the term "phoneme” 
was invented as distinct from “phone” in 1879 by a linguistic 
scholar named Kruszewski, a pupil of the Polish linguistician 
Baudouin de Courtenay. 4 The phoneme idea has also been (to 
quote Firth) “implicit in the work of all phoneticians and ortho- 
graphists who have employed broad transcription.” In particular, 
it is implicit in the distinction drawn by Henry Sweet between 
"broad" and "narrow" transcription in his Handbook of Phonetics 
(Oxford, 1877), pp. 103-105, and in the principles of transcription 
formulated by Paul Passy for the International Phonetic Associa¬ 
tion in 1888.® 

About 1937, after having made much use of the phoneme theory in 
practical language teaching for some thirty years, I felt that the 
time had come when a book should be written setting out in some 
detail the theory of the phoneme, and illustrating it from as many 
languages as I knew anything about. The work occupied me for 
much longer than I had anticipated, and it was not until January, 
1946, that I was able to send my manuscript to the publisher. Even 
then there were many unexpected delays, and the first edition of 
the book did not actually appear until 1950. 

While I was still engaged in the preparation of the book, a note¬ 
worthy and suggestive work entitled Phonemics appeared in America 


5 The Word "Phoneme" by J. R. Firth in Le Mattre Phonitique, April, 
1934, p. 44. See also footnote 7 to §662 of this book. 

4 The French word "phonfeme” appears to have been invented by L. Havet, 
who employed it in 1876 to mean "speech-sound.” Many other French 
writers on linguistic subjects have used it in the same sense. 

5 The Phonetic Teacher, August, 1888, p. 57. 
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in 1947, by K. L. Pike of the University of Michigan. It was too 
late for me to incorporate any of Pike’s findings in my book. 
Moreover, the theory as presented by me was (and still is) the original 
theory conceived in the 1870’s by Baudouin de Courtenay, though 
expounded mainly on "physical” (as distinguished from "psycho¬ 
logical”) lines, and I see no reason to deviate from it. 

The greater part of the present (third) edition is therefore in the 
main a reprint of the first edition; but to make matters clearer I 
have added an Appendix on “The History and Meaning of the 
Term ‘Phoneme’,” 4 in the course of which I have emphasized the 
fact that my conclusions concerning the phoneme, like those of my 
predecessor Baudouin de Courtenay, have been arrived at entirely 
through phonetics—in fact, that phonemics is an essential part of 
phonetic science. It is shown too how the introduction of a single 
new term "signeme,” to denote "any speech feature whatever 
(segmental or otherwise) which can be made use of for distinguishing 
meanings,” will readily put an end to present-day misunderstandings 
between American and British teachers. 

Finally, I would remark that the phoneme theory has a certain 
bearing upon philosophy, and in particular upon questions relating 
to the “existence” or “non-existence” both of material phenomena 
and of ideas. For if it can be shown that phonemes are abstract 
things which when brought into concrete manifestation appear in 
different forms according to physical environment (a possibility 
suggested in Chapter XXIX), then they would appear to provide 
particularly striking examples of such types of manifestation. So 
since research into the nature of things in general demands enquiry 
into phenomena of this sort, I am in hopes that some of the facts 
here recorded may come to the notice of those who specialize in 
metaphysical problems. 

D. J. 

University College, London. 


• A slightly amended version of a paper first published as a supplement to 
Le Maitre Phonitique in 1957, reproduced here by kind permission of the 
Editor, A. C. Gimson. 
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LIST OF PHONETIC SYMBOLS 

The following is a list of the phonetic symbols used in this book, 
with short particulars of their meanings. It is to be understood 
that most symbols have at times values deviating from what may 
be considered to be their most usual values. 

(1) Consonant Letters of the Roman Alphabet. 

p, b, t, d, k, g (hard), m, n, I, f, h have their customary values. 
The others are as follows: 

c breathed palatal plosive; also used when convenient to 
denote the affricate tj (Eng. ch). 

j voiced palatal fricative; also the corresponding semi-vowel 
(Eng. y in yet). 

q breathed uvular plosive. 

r stands for the various r-sounds of different languages (Eng., 
Fr., Ger., etc.); replaced by other symbols, j, f, r, when 
necessary. 

s as in Eng. see. 

v as in Eng. ever. 

w as in Eng. well. 

z as in Eng. lazy. 

(2) Other Consonant Letters. 
t, retroflex plosives. 

$, 5 , etc., palatalized t, s, etc.; represented in digraphic transcrip¬ 
tion by tj, sj, etc. 

t velarized t; also used for pharyngalized t. 
i voiced palatal plosive; also used when convenient to denote 
the affricate d 3 (Eng. j). 

? glottal stop. 

p Cairene Arabic glottal stop, when corresponding to classical q. 
nj labio-dental nasal. 
r\ retroflex nasal, 
ji palatal nasal (Fr. gn). 
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LIST OF PHONETIC SYMBOLS xv 

q velar nasal (Eng. ng). 

q Japanese syllabic nasal, 

i breathed I (Welsh fl). 

1 velarized (dark) I. 

J a sound intermediate between d and-1. 

X palatal lateral (Ital. gl). 

f single flap tongue-tip r. 

[ retroflex flap. 

r rolled or flapped uvular r (one variety of Parisian r). 

(|>, p bi-labial fricatives. 

0, 8 dental fricatives (Eng. <A-sounds in thing, then). 

j fricative tongue-tip r; also the corresponding frictionless 
continuant, and a retroflexed variety of this; also 
r-coloured a. 

J, 3 palato-alveolar fricatives (Eng. sh, Fr. j). 
s velarized s; also used for pharyngalized s. 

5 breathed palatal fricative (one variety of the Ger. icA-sound). 
e, f alveolo-palatal fricatives (Polish £, £). 
y voiced velar fricative, 

h breathed pharyngal fricative. 

<5 voiced creaky sound made with contracted larynx and 
pharynx (Arabic ‘ain). 
fi voiced h. 

i| consonantal y (= y). 

% dental click. 

!> lateral click. 

tj, d 3 , ts, etc. affricates (one mode of representation), 
ph, th, etc. aspirated plosives. 

(3) Vowel Letters. 

i as in Fr. si, and sounds near to this; used also when con¬ 
venient for the Eng. short i as in sit. 
e as Fr. e, and shades of sound near to this; also used in place 
of £ when possible, e.g. in transcribing the Eng. vowel in set. 
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£ as Fr. i, and sounds near to this; sometimes used for the 
Eng. vowel in set. 

a as in Parisian Fr. la, and sounds near to this; also used in 
the Simplified Transcription of the Eng. short vowel in 
hat and the long vowel in half. 

a as in Parisian Fr. las, and sounds near to this; used for the 
Eng. long vowel in half in narrowed transcription. 

d as o in Fr. forte, and sounds near to this; used in narrow 
transcriptions of Eng. saw (long), hot (short), Ger. Sonne, 
etc. 

o as in Fr. beau, and sounds near to this; replaces a in broad 
transcriptions of Eng. saw, hot, Ger. Sonne, etc. 
u as in Ger. Schuh, and sounds near to this, e.g. the vowels 
in Fr. coup, Eng. too ; used also when convenient for the 
Eng. short u in put, book, 
y close lip-rounded i, as Fr. u, Ger. u. 

0 close lip-rounded e, and sounds near to it, e.g. the Fr. vowel 
in peu. 

oe open lip-rounded e, and sounds near to it, e.g. the Fr. vowel 
in ceuf. 

d open lip-rounded a; used for Eng. vowel in hot in narrow 
transcription. 

a unrounded a; also used for Eng. vowel in cup. 
v unrounded close o. 
uj unrounded u. 

t vowel intermediate between i and ui. 

« vowel intermediate between y and u (= a lip-rounded ♦). 

i a lowered and retracted variety of i; used in Tswana, and 

in transcribing Scottish and American pronunciation of 
sit, etc.; also in narrow transcription of Southern British 
pronunciation of such words, 
t a lowered variety of i. 

o a lowered variety of u, or a very close variety of o; used in 
Tswana, and for the Igbo 0 , also used in transcribing 
American pronunciation of book, etc., and in narrow trans¬ 
cription of Southern British pronunciation of such words. 



LIST OF PHONETIC SYMBOLS xvii 

y a lowered and retracted variety of y; used for Ger. short y 
in narrow transcription. 

a unrounded central vowel (schwa), as Eng. a in along. 

e a lower variety of central vowel, as in Eng. sofa or in Lisbon 
Portuguese para ; also sometimes used to denote the 
quality of long a: in narrow transcriptions of Southern 
English. 

ae a raised a or a very open e; used for Eng. short a in narrow 
transcription. 

t^, 3 , r-coloured a, o. 

r-coloured a; also represented by j or ac 

An index letter means that the sound is that of the main 
letter modified in the direction of that indicated by the 
index. Thus hs means a 5 -like variety of h, and 3 * 
denotes a sound intermediate between 3 and 2 . 

(4) Diacritic Marks. 

" nasalization; l = nasalized e. 

, devoicing; g, j, j = unvoiced n, I, 3 . 
v voicing; s = z, t = American “voiced t.” 

. close variety; e = a very close e, a = ae. 

c open variety; q — a sound between French e and e. 

=> lips more rounded, 

c lips more spread. 

+ advanced variety; u+ or u = a sound between u and u. 

- or - retracted variety; a- or a = a sound between a and a. 
x reused variety; a*, or a = a = ae. 
t lowered variety; eT or e = 5 . 

central vowel; ii = «, e = a high variety of 9 . 

' slight aspiration after p, t, etc. 

’ glottal stop accompanying p, t, etc.; glottal contraction 
accompanying continuant sounds, Danish sted. 
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indication that a vowel is consonantal: jr = q; also used to 
mean that a continuant sound is very short, as ih in 
Sinhalese mb, 8 in Tswana 86. 

under a letter (or over it if the letter has a tail below) means 
that the sound is syllabic; n = syllabic n. 

simultaneous pronunciation of two sounds, e.g. Provensal 
mrj. 

length mark. 

half length. 

at the beginning of a syllable denotes strong stress. 

at the beginning of a syllable denotes extra strong stress. 

at the beginning of a syllable denotes medium (secondary) 
stress. 

(thus a or "a) high level tone. 

in Burmese, a level tone ending with a slight fall and pro¬ 
nounced with creaky voice; in Tswana a high level tone 
requiring that the next succeeding high tone shall be 
slightly lower. 

(thus a or _a) low level tone; in Vietnamese, low tone com¬ 
bined with a creaky voice. 

(thus a or 'a) high rising tone, or rising tone without implica¬ 
tion of height. 

(thus a or ,a) low rising tone. 

(thus i or 'a) high falling tone. 

(thus $ or,a) low falling tone; in Vietnamese, low tone 
combined with breathy voice. 

(thus a or *a) rising-falling tone. 

Panjabi low rising tone. 

(preceding the syllable) Vietnamese rising tone combined 
with creaky voice. 

(preceding the syllable) Vietnamese rising tone combined 
with breathy voice. 
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SPEECH-SOUNDS 

1. Speech is the uttering by the larynx, mouth, nose, etc., of 
various noises which people have agreed to regard as conventional 
symbols of certain meanings. 

2. Nearly every utterance, or "speech chain,” is made up of a 
large number of small elements no two of which are alike. That 
this is so can be seen by the appearance of sound-tracks or oscillo¬ 
grams or enlargements of the grooves of gramophone records. It 
is comparatively rarely that we find sections of a speech chain 
in which the sound has unvarying quality, intensity and 
pitch continued for an appreciable time, i.e. the type of sound 
indicated on oscillograms by a succession of waves of similar 
shape. 

3. For linguistic purposes it is convenient to consider speech 
chains as divisible into sections which we call “speech-sounds” or 
“phones.” The points of separation between successive speech- 
sounds are sometimes shown by abrupt changes in the patterns of 
sound-tracks, etc. Examples of abrupt changes are shown in the 
Frontispiece. More often, however, the points of separation are 
not clearly apparent on such tracks. It has in fact been shown that 
from the viewpoint of physics the "speech-sound” is non-existent, 
and that when one of the entities which we call speech-sounds is 
followed by another, the two more often than not merge into one 
another in a gradual manner. It has been shown that the 
characteristics of a “speech-sound” generally begin to appear before 
the preceding “speech-sound” has terminated and continue to 
show themselves after the succeeding "speech-sound” has begun. 1 
The mechanism is such that what might be termed the character¬ 
istic part of the speech-sound is, when regarded from the 
standpoint of physics, generally not maintained for any appreciable 
time. 


1 See, for instance, E. W. Scripture’s remarks on "overlapping of speech- 
sounds” in his paper on The Nature of Speech in the Proceedings of the Second 
International Congress of Phonetic Sciences (1936), pp. 217-219. 
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4. Nevertheless, the conception of the chain of speech-sounds 
is indispensable in all linguistic investigations. It is, moreover, 
justified by the means by which speech is produced. Speech is 
the result of certain actions performed by the articulatory 
mechanism. The lips, tongue, etc., take up various positions or 
make various movements successively, and these positions and 
movements can be described and classified. The conception 
would appear to be also justifiable on psychological grounds: when 
we speak, we think we utter successions of sounds most of which 
are held on for an appreciable time; and when we listen to speech, 
we think we hear similar successions of sounds. The effect is so 
definite to us that we have as a rule no particular difficulty in saying 
what the sounds in words are, or in assigning letters to them in 
alphabetic writing. 

5. The fact that the change from one speech-sound to the next 
in a sequence is generally performed in a gradual manner is im¬ 
material from the linguistic point of view. Sweet and the other 
pioneers of modern phonetics recognized that passing from one 
"sound” to another was a gradual process. In their terminology 
it is said that consecutive speech-sounds are linked together by 
transitory sounds called "glides.” 2 A “glide” is the sound produced 
by the movement of passing in a natural manner from the position 
(or terminal position) of one speech-sound to the position (or initial 
position) of the next: it is an unavoidable sound,-and has no linguistic 
significance. A glide may involve movements of several parts of 
the speech mechanism; these need not, and in fact generally do 
not, take place exactly simultaneously. 

6 . The point of separation between one speech-sound and the 
next in a chain may be taken to be any point in the glide which 
it is convenient to choose on linguistic grounds. In most cases 
this point does not correspond to any abrupt change in the pattern 
of a sound-track or oscillogram. 

7. The linguistic conception of the “speech-sound” is determined 
by the possibility of removing a section from a chain and replacing 
it by a section of another chain, 3 the sections being such that the 

2 Sweet, Handbook of Phonetics (1877), §189 and elsewhere. A. J. Ellis 
called them “transitional configurations” in his Essentials of Phonetics (1848). 

3 The glides being at the same time replaced by other appropriate glides. 
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exchange is capable of changing a word into another word. I do 
not think it is possible to give an unassailable definition of a "speech- 
sound,” any more than it is possible to give a precise definition of 
a "word,” 4 but a “speech-sound” may be stated roughly to be a 
minimum section capable of being exchanged in the manner above 
indicated. 

8 . Thus we say that the full pronunciation of the word az (as) 
consists of two speech-sounds. The section we represent in writing 
by a is the minimum initial section that can be removed and 
replaced by a section of some other chain—say by i (making iz). 
Similarly, the section we represent by z is the minimum final 
section that can be removed and replaced by a section of some 
other chain—say, by J (sh) (making aj) or by m (making the full 
pronunciation of am). In the last word the separation is less 
precise because the nasality of the consonant is anticipated in the 
vowel. 


"Simple,” "Gliding” and "Compound” Sounds 

9. Speech-sounds which require for their production a static 
position of the speech mechanism (apart from the mechanism 
which propels the air and the mechanism which produces "voice”) 
may be called “simple sounds.” Speech-sounds formed by gradual 
movements of the lips, tongue, etc., may be called "gliding sounds." 
Speech-sounds which in the course of their production have an 
abrupt change of position of the articulating organs may be called 
“compound” sounds. 

10. Examples of simple sounds are ordinary m, n, I, f, z and 
the "pure” vowels. Trills (e.g. rolled r) may also be classed among 
the simple sounds. 

11. Simple sounds are, as a rule, represented in phonetic writing 
by single letters. It is occasionally justifiable to represent them 
by digraphs (sequences of two letters) in order to effect an economy 
of symbols. 


4 The nearest definition that can be suggested for a "word” is perhaps 
that given in L. R. Palmer’s Introduction to Modern Linguistics, p. 79: "the 
smallest speech unit (= constantly recurring sound-pattern) capable of 
functioning as a complete utterance.” But even this appears to need some 
amplification (see Jespersen, Philosophy of Grammar, pp. 94, 95). 
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12. Examples of gliding sounds are English w and j and the 
diphthongs. Examples of compound sounds are the plosive 
consonants; for other examples, see §258 if. Gliding sounds and 
compound sounds are sometimes represented phonetically by single 
letters and sometimes by digraphs; for this, see §§14-19. 

13. It is to be noted that in compound sounds the amount to 
be designated a “speech-sound” depends upon the phonetic context. 
Thus the speech-sound which we represent by the letter k in 
'djoki ( jockey) consists of a "stop” plus a “plosion”; nothing less 
can be removed and replaced by some other section to form a 
different word (say by I making 'd 3 oli, jolly). A k with plosion 
is thus a single speech-sound in this context. On the other hand 
the speech-sound which we represent by k in akt (act) consists of 
a “stop” only, in a common English pronunciation of this word; 
the replacement of this by, say, n or by the stop of p converts the 
word into other words (ant, apt), p and k without plosion are 
thus "speech-sounds” in this phonetic context. 

Representation of Gliding Sounds and Compound Sounds 

14. In some cases a gliding sound is most conveniently repre¬ 
sented in transcription by a single letter. Thus the initial gliding 
sounds in the English words wet and yard are appropriately written 
with w and j. The gliding portion of an "imperfect diphthong" 6 
may also be written with a single letter. Such a sound is that 
heard in one pronunciation of the French pa:j ( paille). 

15. The gliding sounds called diphthongs,* being sounds which 
begin at one vowel and immediately move in the direction of 
another, are best represented by digraphs of which the first letter 
indicates the starting point and the second the direction of move¬ 
ment. 

16. Compound sounds may be considered in two categories: 

(1) those in which a constituent part resembles a "simple” sound, and 

(2) those in which constituent parts have no close resemblance to 
"simple” sounds. The chief examples of the first category are the 
affricates. An affricate has a “stop” and a “plosion" which 

6 For ‘‘imperfect diphthongs.” see my Outline of English Phonetics, eighth 
and subsequent editions, §227. 

* Perfect diphthongs. 
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terminates with friction reminiscent of the corresponding fricative 
consonant; it may have an aspiration in addition to this. Affricates 
may be represented either by single letters or by digraphs; some¬ 
times one method is the more convenient and sometimes the other. 
Thus I find it generally advisable, in the interests of symbol economy, 
to write the English affricate commonly written ch with a digraph; 
tj is chosen for this purpose since the on-glide and stop of this 
affricate resemble those of the English t, and the friction heard 
when the stop is released resembles the fricative sound J. 7 Similarly, 
it is usual in phonetic transcriptions of German, to employ the 
digraph ts to represent the affricate in tse:n ( zehn ), etc. On the 
other hand, it is more convenient to represent the affricate sounds 
of such Indian languages as Hindi and Bengali by the single 
letters c and j (in Romanic Orthography c and j)* The same 
system is probably also advisable in transcribing Turkish, where 
for instance acan (the one who opens) has to be distinguished from 
atjan (thirsty); the possible alternative system of writing atjan and 
at-Jan respectively is probably not sufficiently distinctive. 

17. Other examples of compound sounds of the first category 
are the mb, nd, etc., of Sinhalese and of Bantu languages, for which 
see §§260 ff.; they are adequately represented by a letter representing 
the nasal part followed by a letter representing the plosive part. 
Yet others are the consonants which have double articulation and 
where one of the articulations terminates before the other. These 
can be written with digraphs, whenever the use of a single letter 
would be ambiguous or inconvenient. Examples are plosives 
pronounced with simultaneous glottal stop (p\ t’, etc.) and clicks 
pronounced simultaneously with rj or g. In writing the former 
the ’ can be dropped if the other p, t, etc., occurring in the language 
are aspirated and therefore expressible by ph, th, etc., which appears 


7 In the rare cases where the phoneme t is followed by the phoneme J in 
English, a hyphen must be inserted to make the transcription clear: e.g. 
'hat-fop (hat-shop), 'koit-Jip (courtship). Such instances are only found in 
compound words or words formed with a suffix. 

• The Indian t and d are very dental, and it is, therefore, unsuitable to 
employ their symbols to denote the first part of these affricates. Moreover, 
when the sequence dental t + J occurs, the plosion of the dental t is heard 
and gives to the sequence an effect entirely different from the affricate c: 
thus the Bengali words kutjit (ugly) and kucit (rarely) are quite distinct. 
The group tC is also distinct from doubled C. 
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to be generally if not always the case. 9 For the latter I recommend 
the notation qi, gb, etc. 10 

18. The second category of compound sounds comprises the 
ordinary (non-affrieated) plosives, such as p, b, t. These, when 
fully pronounced, consist of a "stop” and a “plosion” followed 
very often by an “aspiration.” There is as a rule no reason for 
writing them with more than single letters. 

19. Those plosive consonants which are made by double articula¬ 
tion and in which the two articulations terminate simultaneously 
may, however, with advantage be written with digraphs. Thus 
it is justifiable and advantageous to write the velar-labial plosive 
of the Efik language of Nigeria with the digraph kp, for though 
ordinary k and p also occur in the language, p never follows k 
in any word. 11 


Concrete and Abstract Sounds 

20. There are “concrete” sounds and “abstract" sounds. A 
concrete sound is a physical thing, a sound actually uttered on a 
particular occasion. When we speak of hearing a sound or making 
a sound, the reference is generally to concrete sounds. An abstract 
sound may be said to be that which is common to, or can be 
abstracted from, a number of utterances of what we call “the 


• E.g. in Tswana, where for instance the p of pola (to worry) is commonly 
pronounced with simultaneous glottal stop, and is distinguished from ph5la 
(to jump off). The Georgian language contains similar pairs of consonants. 
The aspirations of the aspirated p, t, etc., are, however, weaker than those 
of Tswana. It is perhaps preferable in transcribing Georgian to write 
p, t, etc., for the aspirated consonants and p\ t’, etc., for the consonants 
accompanied by glottal stop. See Le Maitre Phonitique, January, 1944, p. 5. 
For the unusual case of Korean, see §194. 

10 C. M. Doke has invented special letters for all these click combinations, 
and has used them in his Phonetics of Zulu and elsewhere. D. M. Beach 
has also invented special letters fof the clicks pronounced simultaneously with 
o; see his Phonetics of the Hottentot Language. I venture to think that the 
introduction of these new letters is unnecessary, and that the notation gj, gb, 
etc., satisfies all requirements and is justifiable on grounds of symbol economy. 
When a nasal is followed by a click—combinations which are found in Xhosa 
side by side with the "single sounds” gj, gb, etc.—the letter n may be used; 
thus nj, nb. 

11 See I. C. Ward, Phonetic and Tonal Structure of Efik, p. 7. 
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same sound.” 12 Abstract sounds can be pictured to oneself 18 ; 
concrete sounds are the physical manifestations of abstract sounds. 
When the term "sound" is used in such a book as the present one, 
the context shows whether a concrete or an abstract sound is 
meant. When one talks or writes about sounds, the reference is 
generally to "abstract sounds of the first degree.” 14 


CHAPTER II 

THE PHONEME 

Need for the Phoneme Idea 

21. It is constantly found in language study that several 
distinct sounds (§20) in a language have to be considered as if 
they were one for orthographic, grammatical and semantic purposes. 
Thus, although the English k’s in ki:p {keep), ko:l [call) and ku:l 
{cool) are distinct sounds, it is necessary for practical linguistic 
purposes to treat them as if they were one and the same. 

22. In fact, many of the elements of language commonly 
termed “sounds” or “essential sounds” are in reality small families 
of sounds, each family consisting of an important sound of the 
language together with other related sounds which, so to speak, 
"represent” it in particular sequences or under particular conditions 
of length or stress or intonation. 

"Definition” of a Phoneme 

23. It is to such a family that the term “phoneme” is applied 
in this book. The sounds included in it may be termed its 
“members” or “allophones.” 15 

13 This is no definition, since the idea of the abstract sound is implicit in 
the expression “the same sound.” It has to be assumed that readers under¬ 
stand what is meant (see Preface). 

13 Though I have been told by a psychologist that not everyone can do this. 

14 See further my paper on Concrete and Abstract Sounds in the Proceedings 
of the Third International Congress of Phonetic Sciences. Ghent, 1938. 

16 The convenient term "allophone" was invented by the late American 
linguistician Benjamin Lee Whorf about 1934. Members of a phoneme 
have also been termed "conditioned variants.” They are also said to be in 
"complementary distribution.” 
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24. When a phoneme comprises more than one member, it 
generally happens that one of the sounds seems more important 
than the other(s). This may be because it is commoner than the 
other(s), or because it is the one used in isolation, or because it is 
intermediate between extreme members. Such a sound may be 
termed the “principal member” or “norm” of the phoneme. The 
other sounds in the same phoneme may be called “subsidiary 
members” or “subsidiary allophones.” 1 Subsidiary members have 
also been termed “divergents” and “sub-phonemic variants.” 

25. Before proceeding further it is necessary to give as precise 
an explanation as may be feasible of what a phoneme is. I say 
an “explanation” rather than a “definition,” because in spite of 
the simple appearance of the example in §21, a little consideration 
shows us that an “exact” definition is impossible. We cannot 
define what a phoneme is without making use of terms such as 
“language,” “speech-sounds” and "words,” all of which are in¬ 
capable of definition like the fundamental concepts in other sciences 
(see Preface and §7). 2 We can restrict to some extent the meanings 
of these terms, but when the restrictions have been formulated, 
we still have to assume that the things for which the terms stand 
exist and that people know what they are. 

26. Some have expressed the opinion that phonemes can be 
“defined” simply by reference to their function as units suitable 
for alphabetic expression. F. S. Wingfield, for instance, whose 
main interest in linguistic matters lies in the reform of English 
spelling, has suggested that a phoneme may be defined as "a group 
of speech-sounds nearly enough alike to be treated as a unit for 
alphabetic purposes.” 3 Such a rough-and-ready definition may 
perhaps sometimes serve in a popular exposition; but though it 
underlines one of the chief uses of the phoneme theory, it will 
not apply in all cases, and will not, I think, be found sufficiently 
precise for use in those detailed linguistic investigations which 


1 It is, however, maintained by some that as all members of a phoneme 
are essential, no one member can properly be described as more important 
than another. 

2 See also my paper Some Thoughts on the Phoneme in the Transactions of 
the Philological Society, 1944, pp. 119, 120. 

3 In his Bulletin of reformed spelling, February, 1940, p. 5. 
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need the theory. In the following paragraphs, therefore, a general 
explanation is given of the nature of the units which we call 
phonemes. This will then be exemplified and amplified in the 
course of the book. 

27. In this explanation special senses are to be attached to the 
terms "language” and “phonetic context,” as shown in the following 
paragraphs. 

28. A "language” is to be taken to mean the speech of one 
individual pronouncing in a definite and consistent style (i.e. an 
idiolect). It does not include sounds used in different styles of 
speaking. The style referred to, in the absence of indication to 
the contrary, is a consistent use of the forms of natural, unstudied 
but reasonably careful and not rapid conversation. It is the style 
termed by Paul Passy "prononciation familiere ralentie.” As no 
two people speak exactly alike, the names of languages ("English,” 
“French,” etc., or “Southern British English,” “Parisian French,” 
etc.) refer throughout the book to the speech of imaginary people 
using “slow conversational pronunciation” in what appears to be 
a typical or “average” manner. "German,” in the absence of any 
indication to the contrary, means stage German as described in 
Viktor’s Deutsches Ausspracheworterbuch and other works by the 
same author, and in the new (1957) edition of Siebs’ Deutsche 
Hochsprache. 

29. The reason for restricting the term “language” in this 
manner is that the more general use of the term covers the speech 
of people who speak in ways differing considerably from each 
other—who use different sounds, or who use sounds in such a way 
that a different phonemic classification is called for. 4 See further, 
§658. 

30. The term “phonetic context” of a sound is to be understood 
to mean the sounds next to it or near it in the sequence of which 
it is a part, together with its duration (length), stress and (if voiced) 
voice-pitch. "In the same phonetic context” means "when 
surrounded by the same sounds and subject to the same conditions 
as regards duration, stress and voice-pitch.” For the sake of 


4 For instance, some Frenchmen have two a-phonemes and others only one. 
But they speak the same "language” as the term is commonly understood. 
See §§621-623, also Some Thoughts on the Phoneme, p. 125. 
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brevity it is sometimes convenient to use the term “controlling 
principles” to denote the phonetic contexts which condition the 
use of special members of phonemes. 

31. With these uses of terms in mind, and regarding the phoneme 
as a family, we may say that a phoneme is a family of sounds 
IN A GIVEN LANGUAGE WHICH ARE RELATED IN CHARACTER AND 
ARE USED IN SUCH A WAY THAT NO ONE MEMBER EVER OCCURS IN 
A WORD IN THE SAME PHONETIC CONTEXT AS ANY OTHER MEMBER. 

32. The restriction "related in character" is of necessity vague, 
since it is impossible to specify what degree of dissimilarity will 
prevent two sounds from belonging to a single phoneme. Many 
examples could be adduced to illustrate the difficulty. It does 
not seem possible, for instance, to prove beyond question that h 
and rj belong to separate phonemes in English (see §53). Yet 
they are so remote from each other that it would obviously be 
absurd to consider them as members of one phoneme. On the 
other hand, though voiceless 1 and voiced I axe acoustically very 
dissimilar, they clearly belong to a single phoneme in French 
(§73); and in German we find ourselves obliged to group into one 
phoneme sounds so different as e: and an o-like vowel (§91). 

33. The relationship between sounds in a phoneme may be 
acoustic or organic. For instance, in the case of t and the glottal 
stop (?), which probably belong to one phoneme in some types of 
English (§§618, 620), the relationship is acoustic; to the ear they 
have a certain similarity in some situations in spite of the great 
difference in their manner of formation. On the other hand, the 
acoustically different sounds J and 1 belong to the same phoneme 
in French by virtue of their organic relationship. 

34. The restriction "in a word” is important. To extend the 
definition to cover word-groups or sentences would greatly com¬ 
plicate matters. At the best it would increase the number of 
phonemes in some languages 5 ; it might even be found to render 
the elaboration of any consistent theory of phonemes impossible, 
since variations of sound at word junctions may take so many 
forms. 


4 For an example see §282. See also Some Thoughts on the Phoneme, pp. 127- 
132. See also my Outline of English Phonetics, 8th edition, 1966, and subse¬ 
quent editions, Chap. XXXII, especially §1097. 
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35. It is to be taken as axiomatic that one sound cannot belong 
to two phonemes of a language. There are possibly some rare 
exceptions to this; they are dealt with in Chapters XIX and XX. 

36. From the above explanation it will be seen that the three 
different k’s in the English words mentioned in §21 are classed 
together as belonging to the same phoneme, because the variety 
of k used in ki:p is never used in English before o: or u:, and the 
k of ku:l is never used before i: or o:, and so on. 

37. Sometimes a phoneme consists of only one member, or 
more accurately, of sounds, which are so nearly alike that they may 
for practical purposes be regarded as indistinguishable. We may 
say, for instance, that the English consonant phoneme f comprises 
only one member. 6 Likewise the French vowel phoneme o. H. E. 
Palmer suggested the term “monophone” for a phoneme com¬ 
prising only one sound. 

38. It must be mentioned here that in the opinion of some 
the phoneme may be defined in a totally different manner, namely, 
not as family of sounds but as a single entity-—an abstract sound 
of the second degree 7 which appears in the first degree of abstraction 
as a set of abstract sounds, each of which we are able to bring 
into manifestation as a concrete sound. The question of regarding 
the phoneme in this light and in other ways is discussed in Chapter 
XXIX. See also Appendix I. 

39. The more objective course of regarding the phoneme as a 
family of sounds is followed in this book because I have found it 
to give valuable results in practical language teaching and in 
connexion with the construction of systems of alphabetic writing. 
This manner of regarding the phoneme also presents the advantage 
that in certain cases of difficulty it is possible to select the phoneme 
to which a given sound shall belong. Some examples of this are 
noted in Chapters XIV and XX. 

40. Neither the explanation given in §31 nor any other “defini¬ 
tion” of a phoneme I have ever heard of is completely unassailable. 


• Actually it is possible to perceive slight differences between the f’s used 
in different phonetic contexts. 

7 Concrete and Abstract Sounds, p. 4. 
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I do not think it possible to devise an explanation which leaves no 
loopholes for exceptions. The above, however, gives us a workable 
theory which, as this book will show, is applicable in all ordinary 
cases as well as to many cases of an exceptional nature. 

CHAPTER III 

PHONETIC TRANSCRIPTION 

41. Any unambiguous manner of representing pronunciation 
by means of writing is called a "phonetic transcription.” An 
alphabetic system of phonetic transcription consists of letters 
representing sound-qualities (tambers 1 ) or phonemes, together with 
marks or other devices 8 to indicate such of the other attributes 
of speech (especially duration, stress and voice-pitch) as may be 
of importance in the language to be written. 

42. As the letters of a phonetic transcription may represent 
either speech-sounds or phonemes, so also the other marks and 
devices may represent duration, etc., in great detail, or they may 
be reduced to a minimum and show only just so much as is needed 
to distinguish graphically the words which are distinguished in 
speech by these attributes. 

43. A phonetic transcription in which each letter represents a 
phoneme is called a “phonemic” or "linguistically broad” transcrip¬ 
tion. When duration, stress and voice-pitch are denoted by the 
minimum of marks or devices which will serve to distinguish one 
word from another (in languages where these attributes are used 
for word distinctions), the representation of these attributes is 
likewise said to be "broad.” 

44. A phonetic transcription in which separate letters or special 
marks are used to distinguish two or more members of a phoneme, 
or in which some letters are used to represent phonemes and other 
letters or special marks are introduced to indicate particular members 
of phonemes, is called a "linguistically narrow transcription.” If 
use is made of more marks or devices indicating duration, stress 


1 This convenient anglicisation of the French timbre was invented by 
Robert Bridges, the Poet Laureate. 

* E.g. doubling a letter to indicate length. 
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or voice-pitch than are necessary to make visual distinction between 
words distinguished in speech by these attributes, the representation 
of them is likewise said to be “narrow.” 

46. Use is made of both narrow and broad transciiptions in 
the course of this book. Narrow transcriptions are enclosed in 
square brackets [ ], except where it is specifically stated or is evident 
from the context that the transcriptions are narrow.* These 
brackets are to be taken to mean that the transcriptions are narrow 
in at least one particular; they may be narrow in several particulars. 
The context shows in what respects they are narrow. 4 

46. All transcriptions in this book which are not enclosed in 
square brackets, and which context does not show to be narrow, 
are broad (i.e. phonemic). 

46a. A more detailed account of the theory of broad and narrow 
transcription will be found in Appendix A in the 8th edition (1956) 
and subsequent editions of my Outline of English Phonetics (published 
by Heffer, Cambridge). 

47. The phonetic alphabet used in this book is that of the 
Association Phonetique Internationale. It is assumed that readers 
are familiar with this alphabet. The values of the letters and 
the methods of using them are explained in many works, and 
particularly in the Principles of the International Phonetic Association 
and in Jones and Dahl, Fundamentals de Escritura Fonetica. A 
list of the symbols used in this book is given on pp. xiv-xviii. 

CHAPTER IV 

THE SEMANTIC FUNCTION OF PHONEMES 

48. In §31 it was explained that no one member of a phoneme 
ever occurs in the same phonetic setting as any other member. 
In other words the different members of a phoneme are mutually 
exclusive in the phonetic contexts in which they are used. It is 
important to observe that this exclusion of one member of a phoneme 
from a position appropriate to another member of the same phoneme 
is inherent in the nature of a phoneme. 

• E g. in Chapters XIV, XVIII and in §§119, 323 ff„ 530, 531. 

4 Square brackets are also used for another purpose, namely to enclose 
graphical representations of intonation. This use of square brackets cannot 
be confused with their use as indicating narrowness of transcriDtion. 
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49. It follows from this that if two sounds of a language can 
occur in the same phonetic context, those sounds must belong to 
separate phonemes in that language. Short i and a, for instance, 
belong to separate phonemes in English because they can both 
occur initially before the same consonant, as in the words i'lu^n 
[illusion), s'lu^n [allusion). But in the Efik language of Nigeria, 
which also contains these sounds, a is never used in the same 
phonetic context as i (§119); the same two sounds mutually exclude 
each other, and that is why they belong to the same phoneme in 
that language. 

50. It is therefore a corollary to the explanation given in §31 
that phonemes have a semantic function in languages. For since 
a member of a phoneme can occupy the same situation as the 
appropriate member of another phoneme, it is possible to alter a 
sequence by exchanging a sound for the appropriate sound of another 
phoneme. Such an alteration may change a word into another word. 
In other terms the differences between phonemes are “significant,” 
i.e. capable of distinguishing one word from another. 

51. Conversely, the difference between two members of the same 
phoneme cannot be "significant”: it cannot be made use of for the 
purpose of distinguishing words. This is because, if the explanation 
in §31 is accepted, two members of the same phoneme never occur 
in the same phonetic context, and therefore a sequence cannot be 
changed into another possible sequence of the language by an 
interchange of two such sounds. 

52. ' The fact that sounds belonging to separate phonemes are 
capable of distinguishing words, while sounds belonging to the 
same phoneme are not, is thus not part of any “definition ’ of a 
phoneme as here conceived. It follows from our explanation of 
its nature. 

53. The sounds of separate phonemes do not necessarily distin¬ 
guish words, but they are capable of doing so, and generally do so. 
It occasionally happens, however, that no pair of words can be 
found in which the sole difference lies in the substitution of one 
particular phoneme for another. Thus, as far as I know, there is 
no pair of English words differing by the substitution of h for g. 
It would, however, clearly be inexpedient to assign the two sounds 
to a single phoneme on that ground; they are not sufficiently nearly 
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“related in character.” The same appears to apply to rj and 3 
in English (see §152) A 


CHAPTER V 

OTHER SIGNIFICANT ELEMENTS 

54. It is the addition, subtraction or interchange of phonemes 
(as here conceived) which constitutes by far the commonest means 
of distinguishing words. There exist, however, other means of 
effecting word distinctions, such as differences in the duration of 
sounds, differences in the stress of syllables and differences of voice- 
pitch. It appears that there are few, if any, languages which do not 
make some use of at least one of these. These attributes are 
discussed in detail in Chapters XXI-XXV. 

55. Words may also be, and very often are, distinguished by 
complexes of the various attributes of sounds. These complexes, 
which are very numerous, are discussed in detail in Chapter XXVI. 


CHAPTER VI 

MINIMAL DISTINCTIONS 

56. When a distinction between two sequences occurring in a 
language is such that any lesser degree of distinction would be 
inadequate for clearly differentiating words in that language, the 
distinction is termed a "minimal” one. 

57. Minimal distinctions are very commonly effected by the 
addition or subtraction of a phoneme. Thus minimal distinctions 
are shown in the following pairs of English words kwout {quote), 
'kwouta {quota)-, lo: {law), lo:n {lawn); trei {tray), strei (stray); 
wet (wet), west (west). 


1 I would add here a word on "oppositions.” The different phonemes of 
a language are naturally “opposed” to each other in the sense that one 
phoneme can, under suitable conditions, supplant another, the substitution 
causing a word to be changed into another word. A comprehensive theory 
of oppositions has been worked out by various phonologists. For this see 
particularly Trubetzkoy, Grundziige der Phonologie, especially pp. 59-75 (in 
Cantxneau’s translation, pp. 69-87). 
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58. Minimal distinctions may also be effected by the substitution 
of one phoneme for another. Thus, the following pairs of English 
words exhibit minimal distinctions: sit, sat; sit, sip; sit, fit. 

59. In “chrone languages” (§381) minimal distinctions are found 
when a short sound is replaced by the same sound lengthened. 
Thus the French pair bet ( bette ), be:t {bite) exhibits a minimal 
distinction. So also does the stage pronunciation of the German 
words 'trene ( trenne ) and 'treine [Trane), and pairs like taiko (drum), 
taiko: (opposition) in Japanese, are (he has) and a: re (daughters) in 
Kikuyu, okutuma [..■.] (to send) and okutuima [..-.] (to heap 
up) in Luganda. 1 

60. In "stress languages” (§428) minimal distinctions are found 
if strong stress is shifted from one syllable to another, the sounds 
and lengths remaining constant or approximately so. For instance, 
the Spanish 'termino {termino, end), ter'mino ( termino , I finish) 
and termi'no ( termino , he finished) differ from each other by minimal 
distinctions. 2 

61. In "tone languages” (§470) minimal distinctions are found 
when two words differ in tone but not in any other respect. Thus, 
the Cantonese words s sceg (hurt), ^scet) (photograph), "sceg (prime 
minister), s soeg (pair of drawers), / soei] (ascend), _soeg (above) differ 
from each other by minimal distinctions. So do pairs of Tswana 
words such as moa (soot), moa (breath, spirit), kkau (young man), 
likau (kind of wild duck), bontle (beauty), bontlt (the outside). 8 

62. If two sequences are distinguished by simultaneous changes 
in two attributes, the distinction is a minimal one when the amount 
of change in each attribute is not of itself sufficient to differentiate 
words in the language in question. This case may be exemplified 
by such pairs of English words as Ji:p {sheep), Jip {ship), di:d {deed), 
did {did). The vowels of such words in my type of English differ 

1 Dots and lines enclosed in square brackets indicate the tones in tone 
languages. Dots show short syllables; horizontal lines show long syllables. 
(Oblique lines may represent short syllables as well as long ones.) 

2 The shifting of secondary stress does not appear to be adequate for 
differentiating one word from another. (See footnote 25 to §462.) 

3 Narrow transcriptions of these Tswana words, when said in isolation, 
are mo:a, mora, llkaiu, llkaiu, bonitlg, bonitli. The type of Tswana 
referred to here and elsewhere in this book is the Rolong dialect spoken in 
and around Mafeking. 
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both in length and in quality: the vowel in Ji:p is longer than that 
in Jip, and it also has a "closer” quality. 4 

63. Other examples of minimal distinctions by a combination 
of quality and length are those in such English pairs as no:d (gnawed) 
and nod (nod), pu:l (pool) and pul (pull). Minimal distinctions by 
a similar combination are found in German, but in that language 
the quality difference is notably less than in English, and the length 
difference correspondingly greater. Examples are 1 biite (biete), 
‘bite (bitte), zi :? 5 (Sieg), zig (sich). 

64. Minimal distinctions are sometimes due to changes in three 
attributes simultaneously. In Serbo-Croat, for instance, minimal 
distinctions may be effected by a combination of length, stress and 
voice-pitch (see §§466, 547, 558-560). In Pekingese, too, we find 
that the difference between the first (high-level) and third (low- 
rising) tones is accompanied by a difference of vowel quality and of 
length (see §§543, 557). 

65. Sequences are, of course, very often distinguished by more 
than a minimal difference. These wider differences may be termed 
"duple,” "triple,” etc., according to the number of minimal distinc¬ 
tions of which the total difference is composed. Duple distinctions 
are those which are the result of two minimal distinctions. These 
are exemplified in such English pairs as sit and sap, sit and lip, 
which each differ in two phonemes. Further examples are seen 
in the French pair bc:t (bite) and bek (bee), which exhibits a difference 
in a phoneme and a significant length, the English pair ni:d (need) 
and nod (nod), which exhibits a difference of quality and duration 
equivalent to two minimal distinctions, 6 the English 'proutest 
(Protest, n.) and pra'test (protest, v.), which differs in a phoneme 
and the incidence of stress, the Cantonese s soet) (hurt) and s seg 
(city) which differ in a phoneme and a tone. 


4 This does not apply to Scottish pronunciation. In Scottish English 
the vowels generally have about the same length (short), and the distinction 
between such words as sheep and ship is rendered adequate by a wider quality 

difference than that found in my type of Southern English. 

6 Alternative pronunciation of zilk. 

6 ni:d (need) and no:d ( gnawed ) are distinguished by a single phoneme, 
while no:d and nod are distinguished by a minimal difference consisting of a 
combination of quality with duration. 
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66 . Triple distinctions are those shown in the English pairs 
fit and pan (three phonemes), hi:t [heat) and' pot (two phonemes 
and a complex of quality and length), 7 the French pair bs:t [bite) 
and Jsk ( cheque) (two phonemes and a significant length), the 
English pair 'preznt ( present , n.) and pri'zent ( present , v.) (two 
phonemes and the incidence of stress), the Cantonese pair 'saig 
(provincial) and N seg (city) (a phoneme, a length and a tone), the 
Kikuyu pair ra:ra [- .] (spend the night) and rara [• / ] (be tepid) 
(a length and two tones), and so on. 


CHAPTER VII 

EXAMPLES OF PHONEMIC GROUPING 

67. A few examples of the grouping of sounds into phonemes 
have already been given, and notably the English k-sounds (§§21, 
36). In the present chapter are shown other examples of various 
types. They are intended to give the reader a general idea of the 
principles involved. 

68 . It will be observed that all illustrations of the phoneme 
require a comparison of pairs of sequences differing in two respects. 
In most cases one of the differences is evidently the chief means of 
distinguishing the sequences, and the other an incidental one 
accompanying or conditioned by the chief means of distinction. 
Thus, the English words kill, ko:l, ku:l are distinguished mainly 
by their vowels. The k’s are also different, but these differences 
are clearly subsidiary and conditioned by the vowels. 

(1) Variants of Sound Conditioned by Neighbouring 
Sounds in the Sequence 

(a) Consonants 

69. In English the h-sounds in hed [head), hard [hard), hud 
{hood) are very different sounds, but they have to be considered 
as belonging to a single phoneme, since their use is dependent 
upon the nature of the following vowel. The words are distin¬ 
guished by the vowels, and the differences in the h’s are evidently 
incidental to these. 


7 The number of minimal distinctions is shown by sequences like hilt, 
pi:t [peat), pit, pot, or hi:t, pi:t, po:t (port), pot, or hi:t, hit, hot, pot. 
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70. The g-sounds in sir) (sing) and sot] (song) are different 
sounds, as can easily be heard by isolating them; but these and 
other varieties of r) are used in accordance with the nature of the 
preceding vowel and must therefore all be regarded as belonging 
to a single phoneme. Here again it is the vowels which constitute 
the essential differences between the words, the variations of q 
being incidental. 

71. In Southern English, as I speak it, the l-sounds in li:f (leaf) 
and fill (feel) are different, the first being a moderately "clear” I 
and the second a “dark” (velarized) I. But these two I’s, and 
other I’s of intermediate types, must be looked upon as belonging 
to one phoneme, since their use is entirely conditioned by their 
situations in relation to other sounds in the sequence. (Clear I’s 
are used whenever a vowel follows. 1 Dark I is used finally and 
whenever a consonant follows. Dark I does not occur before a 
vowel, 2 and clear I never occurs finally or followed by a consonant.) 

72. The English t-phoneme is one which includes a number of 
members. The principal t is alveolar and slightly aspirated; it 
is used before vowels in strongly stressed position, as in ten (ten), 
'teibl (table). Subsidiary t’s are: (1) a dental t used before 0, as 
in eit0 (eighth), (2) a retracted t used in tr, as in trein (train), (3) 
an unaspirated or barely aspirated t used in weakly stressed 
positions, as in Meta (letter), (4) a laterally exploded t used in tl, 
as in 'ketl (kettle), (5) a nasally exploded t used in tn, as in 'mAtn 
(mutton). 

73. In French, voiceless I (J), which is used in such words as 
pcepl (peuple), 5:kl (oncle) when final, must be considered as belonging 
to the same phoneme as the ordinary French I. Its use is entirely 
dependent upon situation; for instance, in French the voiceless 
sound J can never begin a word as the ordinary I can. 

74. In French different kinds of d are used in different situations. 
For example, the d of adjektif (adjectif) is an alveolar sound distinct 
from the common French dental d, as in dut (doute). 

75. In many languages, e.g. Italian, Hungarian, the sound q 
is found to the exclusion of n before k and g, but not in any other 

1 There are degrees of '‘clearness,” according to the nature of the vowel. 

2 Except occasionally when it is syllabic, i.e. pronounced with unusual 
length. 
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situation. In Spanish, g is found to the exclusion of n before k, 
g and x, as in [‘0igko ] 3 (cinco, five), [ig'gles] ( ingles, English), 
[na'ragxa] ( naranja , orange), but not in any other situation, g is 
therefore to be considered as belonging to the same phoneme as n 
in these languages. (Broad transcription of the above Spanish 
words would be ' 0 inko, in'gles, na'ranxa.) 

76. In conversational Dutch the sound g exists, but only before 
voiced consonants, e.g. before d in ['zagduk] ( zakdoek, handkerchief), 
where it may be said to “represent” k. g is therefore a member 
of the k-phoneme in that language. 

77. The sounds h, 5 and $ occur in Japanese, as in [giro] (man), 
[hana] (nose), [<|>uku] (luck). They belong, however, to one phoneme, 
their use depending entirely upon the nature of the following 
vowel, h-sounds are used before e, a and o and not before i and u; 
g is used before i and not before any other vowel; <j> is used before u 
and not before any other vowel. (The above words would appear 
in broad transcription as hito, hana, huku, and these are their 
forms in Kunreisiki Romazi orthography.) 

78. In languages containing sounds of the h type varieties of 
"voiced h” (fi) are often found to occur between voiced sounds 
to the exclusion of breathed h-sounds, while breathed h-sounds are 
used in all other situations to the exclusion of voiced h-sounds. 
When this is so, the voiced and breathed h-sounds constitute one 
phoneme. This is found to be the case, for instance, in some styles 
of English pronunciation. Many English people, especially when 
speaking in a rather rapid style, use voiced h in such words as 
sd'hia (adhere), 'boihud (boyhood), bi'haind (behind), though 
they use breathed h initially or when a voiceless consonant 
precedes. 

79. In one form of the Kikuyu language of Kenya the sounds 
<j), (3 and b appear to be used in such a manner that they may 
be regarded as constituting a single phoneme. I refer to the 
speech of those natives who use <|> initially and not elsewhere, 
e.g. in [<J>a:(5a] [-^] (father), (3 medially except after m, e.g. in 
[Yeka(3Q] [. • •] (basket), and plosive b after m and not elsewhere, 


3 Square brackets [ ] mean that the word is transcribed narrowly (§45). 
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e.g. in [mbori] [. •] (goat), [ituimbe] [•-•] (egg). 1 * * 4 These words 
would accordingly be written broadly as ba:ba, gekabu, mbori, 
ituimbe. Similar considerations doubtless apply to the families 
0 , 8, d, as in [0a:5I] [-•] (soup), [ando] [••] (people), and x, y, g in 
Kikuyu, but I have not had opportunities of verifying this. [0a:8I] 
would be written broadly as daidl. 5 * 

80. In one variety of the Igbo language of Nigeria J and s 
belong to a single phoneme, J being used to the exclusion of s 
before i and e, and s being used to the exclusion of J before all 
other vowels. Examples are ojiri [ ‘ . .] (he cooked), ojere [ ’ . .] (he 
said), but ostre [ ‘..] (he wrote), osara [ ‘..] (he washed). In broad 
transcription the first two of these would be written osiri, osere. 

81. In the Tswana (Chwana) language of South Africa there 
exists a peculiar sound intermediate between d and 1, which may be 
written narrowly with the letter J. 8 It occurs before i and u but not 
in any other situations. There is also an ordinary I in the language 
which never occurs before i or u, but does occur before all the other 
vowels (i, e, e, a, o, o, o). Thus, xollmo (above), molurro (sound), 
have the d-like I, while Ilia (to weep), molimo (goodness), tjhob 
(hospitality) have ordinary I. It follows that J must be regarded 
as belonging to the l-phoneme in Tswana. 7 

82. I am informed that in certain dialects of Igbo (Nigeria) a 
sound intermediate between I and r occurs before the vowels i, e, 
q, u and not in any other situations, while an ordinary single flap 


1 Not all native speakers of Kikuyu pronounce in this way. Many do not 
use (j) at all, but use (1 in initial as well as in medial positions. This is the 
form described in L. E. Armstrong’s The Phonetic and Tonal Structure of 

Kikuyu. Some, influenced by English, use an English b in borrowed words 
such as buiku [-'] (book). With them b and (3 are separate phonemes. 

(The natural unstudied pronunciation of the word for “book” by speakers 

unaffected by the importation of English sounds is mbuiku [- ■].) _ 

5 Miss Armstrong suggested a different phonemic classification, which 
I am disposed to think is better than that given above. She regarded the 
Kikuyu mb, nd and gg as single compound sounds constituting separate 
phonemes. This treatment of the sounds is clearly justifiable in this language, 
since when mb, etc., occur initially, the nasal element is very short and 
often so weak as to be almost imperceptible; it may even be absent altogether. 
(See Armstrong, pp. 30-32.) 

8 As in the Sechuana Reader by D. Jones and S. T. Plaatje. 

7 For J as a phoneme separate from I see §127. 
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r occurs before the other vowels (e, a, a and o) and not in other 
situations. Thus, osiri (he cooked) and asere (he said) 
have the l-like r, while osere [ ’.(he wrote) and asara ['.(he 
washed) have the ordinary flapped r. This distribution requires 
that the two consonants should be regarded as belonging to the 
same phoneme in these dialects. 

83. An r-like and an l-like sound also belong to a single phoneme 
in Luganda. The usage in that language appears to be that the 
sound is l-like initially and after the vowels a, o, u, and r-like after 
i and e. Examples are okutoda (to accept), emmaili (wealth), but 
[okumiira] (to swallow), [omuliro] (fire), [emmeiri] (steamer ). 8 

84. In several Indian languages, e.g. Urdu, Bengali, Telugu, 
Sinhalese, the retroflex consonant q occurs when one of the other 
retroflex sounds t or follows, but not elsewhere. Examples are: 
Urdu [ghaqfa] (bell), [koqcjj] (water carrier); Sinhalese eteqfa 
(thither), q must therefore be considered a member of the 
n-phoneme in these languages (compare §128). 

85. Tamil is a language which illustrates particularly well the 
grouping of several quite distinct sounds into single phonemes. 
The grouping has been worked out by J. R. Firth, and details 
will be found in his Short Outline of Tamil Pronunciation .* One 
example may be quoted here. 

86 . Varieties of k, g, x, 9 and y occur in Tamil and are all 
members of a single phoneme. The rules governing the use of 
the sounds are as follows. (1) Single k occurs only initially, (2) 
double k occurs only intervocalically and in the group rkk, (3) g 
is used after g but not elsewhere, (4) single x is used in most inter¬ 
vocalic positions, but (5) 9 is used intervocalically when the following 
vowel is i, and ( 6 ) a variety of y 10 is used when a or u follows in 
final syllables. Examples given by Firth are: (1) [kal] (stone), 
(2) [pakkam] (side), [tiirkka] (to complete), (3) [naiggal] (we), 
(4) [maxan] or [mayan] (son), (5) [ajugi] (decaying), ( 6 ) [kuruc^aryaJJ 
(blind men), [pirayu] (after). 


• See L. E. Armstrong’s summary of the phonetics of Luganda in Wester- 
mann and Ward’s Practical Phonetics for Students of African Languages, 
p. 191. 

* Published as an appendix to the 1934 edition of Arden’s Grammar of 
Common Tamil. 

10 With weak friction. The sound resembles a voiced h. 
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87. It is noteworthy that Tamil orthography does not show 
any difference between all these sounds. Those who originally 
invented this orthography must have had a clear conception of 
the phoneme idea, though the theory had never been formulated. 
This is evidence in favour of a mentalistic view of the phoneme 
(Chap. XXIX). 

(b) Vowels 

88 . In English, as I pronounce it, certain special varieties of 
vowel sound are used exclusively before “dark” I. For instance, 
the vowels in wet (wet) and wel (well) are different, the second 
being opener than the first. The two must, however, be regarded 
as belonging to a single phoneme since the opener vowel only 
occurs before dark 1 and the closer vowel is never found in that 
position. The opener e may be thought of as “taking the place” 
of the common (closer) e in this particular situation. 

89. Again, in my type of English, the u-sound in 'mjuizik 
(music) is not the same as that in lu:z (lose)-, it is a "fronter” sound. 
The usage is that when long u: is preceded by j, the sound employed 
is fronter than that used in other situations. Therefore, these 
fronter and backer sounds must be regarded as belonging to the 
same phoneme in my type of Southern English. 

90. An example from French is found in the sounds of the o 
type. In Parisian French the vowel of not (note), bon {bonne), 
etc., is a central variety as shown in Figure l. 11 But when r 
follows as in port (porte), the variety employed is a back one. 
The two sounds, being used in different phonetic contexts, naturally 
count as belonging to the same phoneme. 

91. In German there are two very different varieties of a, a 
low one not far from o used when r follows in the same syllable; 
and a high one used in other situations. Examples of words 
containing the low a are ['besar] (besser), ['zondarn] (sondern); 


11 This diagram and subsequent similar ones (Figs. 2, 3, 4, etc.) show 
approximate relations of vowels to a system of “Cardinal Vowels” which can 
be conveniently employed as a basis of comparison. For this method of 
indicating the nature of vowels, see my Outline of English Phonetics, Chap. 
VIII. These cardinal vowels have been recorded on a double-sided gramo¬ 
phone record, numbered ENG 252-3, published by the Linguaphone Institute. 
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examples of words containing the high a are [’bits] ( bitte ), [gs'iaistat] 
(geleistet). As these a’s are consistently used according to the 
presence or absence of a following r in the same syllable, the two 
vowels must be classed as belonging to the same phoneme in this 
type of German. They probably also belong to the same phoneme 
as e: (see §§224, 225). 12 



92. In the Danish language varieties of short a are distin¬ 
guishable according as a dental (alveolar) or a velar consonant 
follows. A fronter variety is used, for example, in san’ {sand, 
sand), 'sada {satte, set), sgal (skal, shall), and a backer one in sag’ 
{sang, song), tak {tak, thanks), slags {slags, sort), sayd (sagt, said). 
A still backer variety of a is used in such a word as var’m {varm, 
hot), where a uvular r follows. These varieties of a are evidently 
conditioned by the following consonant and are therefore to be 
assigned to a single phoneme. 

93. In Arabic there occur several varieties of a-sound ranging 
from a very front and raised a to a very back a. Until quite 
recently it has been thought that their use is prescribed exclusively 

12 There are types of German in which the written r of besser, sondern, 
etc., has no consonantal value at all, but in which the er is pronounced as a 
kind of uvularized vowel resembling 3. This sound constitutes a phoneme 
separate from 9 . See Part 2 of Egan’s German Phonetic Reader, where this 
special vowel is represented by the letter E; besser is written 'bESE, sondern 
'zsndBn, etc. Nur, vier, etc., are likewise written with B (niUE, filE. etc.) 
in transcriptions of this style of speech. See also Egan’s remarks on p. viii 
of his book. 
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by the nature of the adjacent consonants, and in consequence, if 
this opinion is correct, they must all be held to belong to the same 
phoneme. The complicated principles regulating the use of these 
vowel-sounds are set out in Gairdner’s Phonetics of Arabic, 
Chapter VII . 13 

94. The principles of phonemic grouping of vowels are particu¬ 
larly well illustrated in the Russian language, and it is worth while 
setting out the case here in some detail. 

95. As is well known, the consonants of Russian fall into two 
sets, the "soft” consonants (viz. the palatalized consonants, 
together with tj, JtJ, and the separate j) and a set of "hard” (non- 
palatalized or velarized sounds corresponding to the palatalized 
consonants, together with J, 3 and ts). Representing the palatalized 
consonants by digraphs formed with j , 14 the corresponding palatalized 
and non-palatalized sounds may be written tj, dj, sj, zj, etc., and t, 


13 I have recently (1945) been informed that there are some errors in 
Gairdner’s analysis of the a-sounds of Cairene Arabic, and that actually 
there is an d-phoneme as well as an a-phoneme in that form of the language. 
I have had no opportunity of corroborating this. In any case it may not 
apply to other types of Arabic. 

14 In monographic phonetic writing the palatalized consonants would be 

written in international notation by the signs 4. $. ?. etc. To my mind 
the digraphic system is generally preferable as being simpler and more easily 
legible than the monographic. (Some of the monographic signs are un¬ 
satisfactory.) In digraphic representation of the soft consonants a single 
j can be used to show that all the consonants in a sequence of two or more 
are palatalized; thus, strj can be used to mean In groups such as 

vzg-lj, where anon-palatalized consonant is followed by one or more palatalized 
consonants, a hyphen can be inserted after the symbol of the non-palatalized 
sound; in cases where this happens in accordance with one of the regular 
rules, the hyphen may be omitted. (These rules are set out in Chap. V of 
the Manual of Russian Pronunciation by S. C. Boy anus, published by Lund, 
Humphries and Co., 1955.) 

Note that, when j is used to denote palatalization, two j’s must be written 
in such words as pjjut (nBIOT, they drink), 'bjjotji (SBCTe, you strike), 
3 i'ljjo (JKIIJibe, dwelling place). 

Incidentally, it may be observed that Russian orthography could be simp¬ 
lified (in the sense that an alphabet of fewer letters could be devised) by the 
consistent use of a letter denoting palatalization. The letter u would then 
no longer be needed, nor would the letters a, 10 and H. On the other hand, 
the sign of palatalization would occur very frequently, and would render 
the written forms of many words longer than they are at present. Some 
might consider this a drawback. 
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d, s, z, etc. There is no doubt at all that the corresponding 
palatalized and non-palatalized sounds must be considered to 
belong to separate phonemes, since they are as a rule quite distinct 
in sound from each other and the qualities of their essential parts 
are invariable. 15 



Fig. 2. Positions of the chief Russian a-sounds. 

96. It is well known too that all the Russian vowels are subject 
to wide variations. These are conditioned by (1) the nature of 
adjoining consonants, i.e. whether they are soft or hard, and (2) 
the degree of stress. It is, for instance, possible to distinguish 
fairly easily five different shades of a in strongly stressed positions 
and two more in medium and weakly stressed positions. The 
shades occurring in strongly stressed positions may be represented 
narrowly by the symbols a- (a very back a), a (a middle a), a- 
(a not very forward a), a (a completely forward a, near to cardinal a), 
ax or ae (a sound between cardinal a and e). (See Figure 2.) These 
sounds are to be heard in the following words: 

■palka [’pa-lka] (nanna, stick), 
mat [mat] (MaT, checkmate), 
mjat [mja-t ](mht, crumpled), 
matj [matj] (Mam, mother), 
mjatj [mjaxtj] (mhti>, to crumple). 

The principal shades occurring in medium and weakly stressed 


15 Though it must be noted that in some special connexions, when the 
sounds are so situated as to be deprived of their glides, the differences between 
certain hards and softs are not very clearly perceived. (See §183.) 



EXAMPLES OF PHONEMIC GROUPING 27 

positions are heard in xara'Jo [xerA'Jo] (xoporno, well), the first 
syllable of which has weak stress and the second medium stress. 
(See further, §§536, 539.) 

97. Specially noteworthy is the case of the Russian i-phoneme, 
which in strongly stressed syllables comprises members ranging 
from a rather back t to a cardinal i, and which includes in weakly 
stressed positions i and a lowered variety of i (t). 16 The four 
most notable members of this phoneme used in strongly stressed 
positions may be represented narrowly by the symbols t (a sound 
between cardinal i and uu), t- (a retracted variety of this), i+ (half 
way between this + and i), i (a somewhat retracted i), u (cardinal i). 
(See Figure 3.) Words illustrating these sounds are: 

bil [bi-l] (6 uji, was), 

bilj [bt+lj] (Sujit, a true fact), 

bjil [bjil] (6hji, he beats), 

bjilj [bjulj] (Ghjis, parliamentary bill). 

i is heard in such words as i'gra [dgra] (wrpa, game), 'dajtji ['dajtji.] 
(AatiTe, give), and t ( = F) in words like 'komnati ['komnatb-] 
(KOMHaTH, rooms), tsi'na [tst'na] (neHa, price). 



[i+] in bicj 
[*] in vi 

(*-] in bil 

[t] in 'vodi 


Fig. 3. Positions of the chief Russian i-sounds. 


98. The Russian e-phoneme comprises sounds ranging from a 
close e to a very open and somewhat retracted e. It occurs 


16 See §§332-335. Also S. C. Boyanus, The i -t Phoneme. 
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exclusively in strongly stressed positions. The following words 
exemplify the use of the chief varieties: 

vjesj [vjesj] (Becb, all), 
vjek [vje*k] (Beit, century), 

'etat ['eTtat] (btot, this). 

(See further, §§308, 332-336.) 

99. In the case of the o and u phonemes the range of variation 
is less wide, but it is noteworthy that central varieties of o and u 
are used when soft consonants both precede and follow, as in 

'tjotja ['tjotjs] (Tern, aunt), 

•ijudji ['Ijudji] (jnogH, people). 

(2) Variations of Sound Quality Conditioned by 
Duration, Stress and Voice-Pitch 

100. The following are examples of different sound qualities 
which are assignable to the same phoneme on the ground of their 
occurrence in relation to the attributes duration, stress and voice- 
pitch. They illustrate differentiations of sound sequences by 
complexes of attributes, a subject dealt with more fully in Chapter 
XXVI. 

(a) Duration 

101. In languages where vowel length is significant it very 
often happens that the quality of a long vowel is not quite the 
same as that of the corresponding short vowel. If the qualities 
of the long and short sounds do not differ widely, it is clearly advisable 
to regard them as belonging to a single phoneme. 

102. A language in which this principle is well illustrated is 
colloquial Sinhalese. This language contains six vowels, i, e, e, a, o 
and u, to which significant length is applicable, and one, a, which 
only occurs short. I have not detected any quality difference 
between the long and short e, but in the case of the other lengthen- 
able vowels perceptible quality differences accompany the variations 
of length. Examples of words differentiated essentially by the 
length of the vowel are silts (cold), sits (mind), herns (beautiful), 
hens (thunder), kails (quarter), kals (pots), poirs (manure), pors 
(quarrels), suidu (gambling), sudu (white). 
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103. The differences in the qualities of the vowels in such pairs 
of words are not great. Their extent may be judged from the 
annexed diagram (Fig. 4), in which the Sinhalese vowels have been 
placed in relation to cardinal vowels. The diagram shows that 
the differences of length are all-important; Sinhalese words, like 
those quoted in §102, would undoubtedly continue to be adequately 
distinguished if the slight differences of quality were not 
observed. 



Fig. 4. Positions of the Sinhalese Vowels. 


104. There are languages in which much wider differences of 
vowel quality accompany differences of vowel length. One of 
these is English, or more precisely one type of Southern English. 17 
In this kind of English the closer i-sound of such words as deed, 
sheep, aesthetic and the opener sound of did, ship, estate (is'teit), are 
connected by the relation that the first is always longer than the 
second in similar situations: the vowel in deed is longer than that in 
did, the vowel in sheep is longer than that in ship, the first vowel 
in aesthetic is longer than that in estate, and so on. These two 

17 I refer here to the type of Southern English which 1 find it convenient 
to select for practical teaching purposes. Other types of English exist in 
which this relation is not consistently observed. There are, for instance, 
varieties of English in which the vowel in bid is in certain situations fully 
long like that of bead] with those who pronounce in this way the two sounds 
belong to separate phonemes (see §§130, 519, 734, also footnote 8 to §513 
and footnote 1 to §592). The sounds would likewise belong to separate 
phonemes in the speech of any who pronounce aesthetic with the closer i 
but consistently make no difference in length between it and the first vowel 
of such a word as estate. 
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vowel-sounds may therefore be regarded as members of the same 
phoneme in this type of English. Similar considerations apply to the 
English long and short o and u sounds, as in caught, cot, pool, pull. 

105. The same thing is found in German, where the long i, a, o, u, 
y and 0 differ in quality from the corresponding short vowels. 18 

106. Further examples of a similar kind may be found in many 
other languages. In Dutch and in Hungarian, for instance, it 
seems clear that the widely differing sounds a and a should be 
considered as belonging to the same phoneme, a being always 
long and a always shorter under similar conditions (see §§523, 524). 
Sounds of these types are likewise assignable to one phoneme in 
various forms of English; the usage is, however, the reverse of 
that in Dutch and Hungarian, the a-like sound being long and the 
a-like sound ([a*] or [*]) short. 19 

107. In cases where the qualities differ widely, but the accom¬ 
panying differences of length are not great, the sound-qualities may 
have to be classed as belonging to separate phonemes, the lengths 
being considered as incidental. Some teachers think that the 
vowels of Southern English should be treated in this way (see 
§§515, 516). 

(b) Stress 

108. The use of aspirated and unaspirated t-sounds in Southern 
English is connected chiefly with the degree of stress. A t with 
considerable aspiration is used before strongly stressed vowels as 
in ten [ten), a'tend [attend)', a less aspirated or unaspirated t is used 
before weakly stressed vowels as in 'priti [pretty), 'a:fta [after), 
'lets [letter), as already mentioned in §72. The two varieties must 
evidently be regarded as members of the same phoneme. 

109. In American English a "voiced t” (a kind of r-like flap) 
is used in many weakly stressed positions, e.g. in 'bstr [better). 

w e and £ belong to separate phonemes in stage German, since in that type 
of German 'eire and 'sire, for instance, are different words (Ehre, Ahre). 
They belong, however, to one phoneme in the speech of those who do not use 
a long open e: (i.e. those who use close ei in such words as Ahre, spat, Nahe, 
aknlich). (See §§224-229.) 

19 Reference is made here to the types of English in which the front a 
([ax] or [ae]) is short. There are other types in which it may be long (see 
footnote 8 to §613 and elsewhere). 
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As the use of this special sound is confined to these positions, 
and an ordinary t does not occur in them, it is to be regarded as 
belonging to the t-phoneme. 

110. In French a close e is used in final positions, i.e. in syllables 
which receive normal strong stress, while opener varieties of e 
(“e moyen”) are used in non-final positions, i.e. on syllables with 
weaker stress. They must undoubtedly be classed as belonging to 
a single phoneme. (See §§214 ff.) 

111. In Russian a number of shades of vowel-sound are found 
exclusively in weakly stressed positions. They are assignable to 
the same phonemes as those to which strong vowels belong. (See 
§§332, 333, 536-539.) 

(c) Intonation 

112. Certain vowels of Pekingese vary with the “tone” under 
some conditions. For instance, when the syllable which may be 
written fen has the first tone (level), 20 the vowel is a rather a-like 
sound, but when the syllable has the third tone (low-rising), 21 the 
vowel is more A-like. These two vowel-sounds must be regarded 
as belonging to the same phoneme, for, as is shown later (§545)-, 
the tones are the essential distinguishing feature of the words, 
and the vowel qualities are incidental to them. (The third 
Pekingese tone also involves length, as mentioned in §557.) 

113. I have noticed also that the Pekingese syllable which may 

be written broadly foil has values depending upon tone which 
may be shown narrowly as follows [”fou, Tqu, In the 

last of these the u element is almost inaudible. 

114. It would appear too that even in such a language as 
English, vowel-quality sometimes varies to some extent according 
to intonation. Professor K. Luick of Vienna made observations 
tending to show that the Southern English o: [a:] appears in a 
closer variety on high pitches and an opener variety on low pitches. 
He also came to the conclusion that ei is less diphthongal on low 
pitches than on high pitches. 22 

20 Meaning “command” or “a tenth” (among other things). 

21 Meaning “powder” (among other things). 

22 Germanisch-Romanische Monatsschrift, Vol. 18, p. 364, and Englische 
Studien, Vol. 65, p. 337. 
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CHAPTER VIII 

DIFFERENCE OF USAGE IN DIFFERENT LANGUAGES 

115. It will be seen from many of the examples given in the 
last chapter that sounds which belong to separate phonemes in 
one language may belong to the same phoneme in another language. 

116. Thus, we find h and g belonging to different phonemes 
in German, 1 but to the same phoneme in Japanese (§77). Varieties 
of voiceless 1 (written phonetically 1 or 4 ) occur as separate phonemes 
from voiced I in Welsh, Zulu and Burmese, but two such sounds 
belong to the same phoneme in French (§73). g and k belong to 
separate phonemes in most languages, but to the same phoneme in 
Dutch (§76). n and g belong to separate phonemes in many 
languages, e.g. English, German, Chinese, Swahili, Tswana, but 
they are members of the same phoneme in Italian, Spanish, 
Hungarian and Hindi (§75). 6, 3 and d belong to separate phonemes 
in English, but may be found as members of one phoneme in one 
form of Kikuyu (§79). 

117. Further examples are the following. The s-phoneme of 
Spanish, as spoken in Buenos Aires, comprises a number of sounds 
which occur as separate phonemes in other languages. A sound 
of the s-type is used before vowels, but before another consonant 
we find h, varieties of g and x, rp and J, as in 

['pahta] (pasta, paste), 

['lihSto] (listo, ready), 

[’boh x ke] (bosque, a wood), 

[■mihmo] or in quick speech ['mirpmo] (mismo, same), 
[•alio] ( hazlo , do it). 

118. Varieties of a and a belong to separate phonemes in some 
types of English (§129 and. footnote 8 to §513), but similar sounds 
belong to a single phoneme in Russian (§96) and probably in some 
types of Arabic (§93). 

119. In the Efik language of Nigeria there exists an i (similar 
to the English vowel in eat) and an a (similar to the sound of a 
in along). In that language these sounds belong to the same 


1 Since they both occur initially before various vowels. 
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phoneme, the rule governing their use being that i is used only in 
open syllables and a only in closed syllables. Examples of i are 
di ['] (to come), itie [. . •] (a seat); examples of a are efak [..] (the 
Efik language), dap [' ] (to hide), jiamme [' ‘ ] (to agree), tabbi [''] 
(to dig). 2 The latter words are rightly written with i in the new 
phonetic spelling of Efik (Efik, dip, nyimme, tibbi). In other 
languages, such as English, German, Welsh, Sinhalese, i and a 
sounds belong to separate phonemes. 

120. s and z belong to separate phonemes in most languages 
in which they occur, e.g. in English, French, German, Roman 
Italian (see §121), Russian, Urdu, Japanese, Zulu. But in Spanish 
they belong to the same phoneme, since in that language z occurs 
to the exclusion of s before voiced consonants and not in any other 
situation; examples are 'mismo, rri'esgo (narrow transcription 
['mizmo], [rri'ezyo]). 

121. It would seem that types of Italian exist in which s and z 
belong to the same phoneme, the rules for the use of these sounds 
being that (1) s occurs initially and before breathed consonants, 
(2) doubled s occurs intervocalically, (3) z occurs before voiced 
consonants, (4) single z occurs intervocalically to the exclusion of s. 

122. The common distribution of g and non-syllabic g 3 in 
Japanese suggests that these two sounds might be held to belong to 
a single phoneme in that language; g occurring solely as a word 
initial (or in an initial-like position) and g elsewhere, as exemplified 
by gai (harm), gakko: (school), but [kugi] (nail) [saggai] 4 (third 
storey [second g]). It seems, however, that there are a few excep¬ 
tions which cannot be explained by any rules, and which therefore 
render a single phoneme grouping impracticable. 8 

123. t and d belong to separate phonemes in most languages 
in which they occur. But it seems that in a common form of 
Greek they have to be assigned to the same phoneme, since in this 

2 All the other vowels of the language (e, a, D, O, U) can occur both in open 
and in closed syllables. These particulars were furnished to me by Professor 
Ida Ward. 

3 For syllabic g see §§292-295. 

4 Written in ROmazi kugi, sangai. 

5 E.g. that go (five) retains the sound g when occurring as the second ele¬ 
ment of a compound, and that some recently borrowed foreign words are said 
with a medial g. (A summary of the chief exceptions is given in S. E. Martin’s 
Morphophonemics of Standard Colloquial Japanese, p. 22.) 
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form of the language d occurs to the exclusion of t after n, but not 
elsewhere. Thus, in this form of Greek, ivSeKot 6 (eleven), SsvTpo 7 
(tree) are pronounced ['endeka], ['Sendro]. (Broad transcriptions 
of these words would be 'enteka, ‘Sentro.) Similar considerations 
apply to the Greek p and k phonemes. 8 9 

124. d and 8 belong to separate phonemes in English, as is 
shown by the existence of the words den [den), 8en [then). Other 
varieties of d and 8 belong to separate phonemes in Danish; this is 
shown by such a pair as 'reda [rette, comply), and 'rsSs [redde, save). 
But analogous sounds belong to a single phoneme in Spanish, where 
a plosive d is only found after n and l 8 ; the case is exemplified by such 
words as ['to8o] (whole), [sole'SaS] (solitude), [me8'rar] (to prosper), 
but ['kwando] (when), [in'dulto] (pardon), [es'palda] (shoulder). 

125. In most languages p and f occur as separate phonemes. 
A. N. Tucker has, however, found that in certain languages of 
the Sudan they belong to the same phoneme, p being used exclusively 
before front vowels and f exclusively before back vowels. Similarly 
with b and v. 10 

126. Sounds of the I and r types belong to separate phonemes 
in European languages, but they belong to a single phoneme in 
some languages of Africa, as mentioned in §§82, 83. 

127. It was stated in §81 that the sounds I and J occur in 
the Tswana language and belong to the same phoneme. These 
two sounds also exist in the cognate language Pedi, but there they 
belong to separate phonemes, since the syllables li and lu occur 
as well as the syllables Ji and Ju. (See A. N. Tucker, Comparative 
Phonetics of the Suto-Chwana Group of Bantu Languages, §146.) 


6 Also (less commonly) spelt EVTSKCC. 

1 Also (less commonly) spelt SsvSpO. 

8 This does not by any means apply to the pronunciation of all Greeks. 
Some use d, instead of the more natural nd, initially in words of modern 
foreign origin. They will pronounce VTOl/S (shower-bath), a Greek rendering 
of the French douche, as dus instead of [ndus]. With speakers who do this 
consistently, d constitutes a phoneme separate from t. It appears, however, 
that the pronunciation of most Greeks is erratic (Chap. XXVIII), and that 
it is in consequence impossible to come to any definite conclusion as to the 
phonemic status of their sounds b, d and g. 

9 In some styles of speech also initially. 

10 Tucker, Eastern Sudanis Languages, Vol. I, §780. 
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128. In Panjabi and Marathi the sounds n and q occur in 
similar phonetic contexts, e.g. they may both precede any vowel. 
The two sounds therefore belong to separate phonemes in those 
languages. But in many other Indian languages the same sounds 
belong to a single phoneme, since in them q occurs exclusively 
before { and (§84). 

129. It sometimes happens that two sounds belong to one 
phoneme in one dialect of a language and to two separate phonemes 
in another dialect of the same language. A notable case is that of 
English "short” vowels. It has already been pointed out in §104 
that in one type of Southern English the difference of quality 
between the vowels in deed and did is accompanied by a difference 
of length. Similarly with the vowels in cart and cat, cord and cod, 
food and good. One may consequently assign each pair of sounds 
to a single phoneme, with the convention that the difference of 
quality accompanies the difference of length. 

130. There are, however, Southern English speakers who 
lengthen the vowels in bid, cat, cod, good, while preserving the 
usual quality. With them these vowel-sounds constitute phonemes 
separate from those in bead, cart, cord, food. (See my Outline of 
English Phonetics, §§874-879.) 

131. It would seem that types of French exist in which the 
front a and the back a are connected by a length relation, a being 
always short and a always long; in other words, that there exist 
French people who use long a: in such words as page, lave, soir 
as well as in pate, atne, croitre, etc., and who use a short front a 
in such words as pas, mat, droite, as well as in Id, patte, boite, etc. 
In such a type of French the two sounds would be assignable to a 
single phoneme. In a common type of Parisian French both 
sounds occur both long and short in similar situations; thus one 
meets with Parisian speakers who use short a in Id, boite, long a: 
in page, poivre, short a in pas, droite, and long a: in pate, croitre. 
In this type of French a and a necessarily constitute separate 
phonemes. (See also §621.) 

132. In some types of German the vowels e and £ constitute 
separate phonemes, while in others they are to be grouped together 
as members of a single phoneme. (See footnote 19 to §105, also 
§§224-228.) 
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CHAPTER IX 

ACOUSTIC IMPRESSIONS 

133. People often get impressions that speech-sounds are other 
than what they are. There are two chief circumstances in which 
sounds are thus “mis-heard” as other sounds. The first is the 
common case where a person fails to estimate correctly the quality 
of a sound of a foreign language or of a dialect, and identifies it 
with a sound of his own mother tongue. When, for instance, a 
Frenchman hears the Southern English sound a:, he is liable to 
think it the same as his French oe or the French group oer; when 
a German hears the Southern English short a, his first reaction is 
to identify it with the German e, incidentally confusing the English 
short e and a sounds. So also, as German does not contain the 
sound of English w, Germans "hear” it as their variety of v. 
Similarly, I am informed that speaker^ of Spanish, having no q 
in their language, mistake the q of the Quechua language of Peru 
for the Spanish x, thereby confusing the two Quechua sounds q 
and x. Instances of this kind can be multiplied indefinitely. 

134. The second case is where the various members of a phoneme 
are not recognized as being different from each other. It does not 
as a rule occur to an Englishman, for instance, that the k-sounds 
he uses in keep, call and cool differ in any way, nor that, in the 
pronunciation of many, the vowel-sound in well is different from 
that in wet. It does not occur to a Dane, unless his attention has 
been specially called to the fact, that the sound of a in the Danish 
word sang differs from that in sand. And so on. 

135. Ability to make accurate estimates of sound qualities can, 
as is well known, be developed by phonetic training. But the 
linguistic investigator always has to be on his guard against the 
possibility of mis-hearing sounds in the two types of case above- 
mentioned. 

136. It is possible too that a particular speech-sound may 
sometimes give the impression of being two different sounds in 
different phonetic contexts, in the same way as a given colour 
may appear to vary on account of the proximity of other colours. 
Optical illusions in regard to colour doubtless have their counterpart 
in aural illusions as to qualities of sound. This, however, is a subject 
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which has not yet been investigated as far as I am aware. If it 
were established that any considerable aural illusions as to the 
quality of sounds are brought about by the proximity of other 
sounds, the facts would presumably have some bearing on the 
phoneme theory. At present, however, it would seem that such 
aural illusions as may exist are insufficient to invalidate any of the 
theory presented in this book. 

137. It is the fact that people are not as a rule aware of the 
existence of differing members of phonemes that has led to the 
common confusion between speech-sounds and phonemes. The 
view of the phoneme propounded in Chapter II of this book is 
that it is a family of speech-sounds. It is only by looking at the 
phoneme in a different light that it can be considered in any sense 
as a sound. It might possibly be regarded as a sound aimed at by the 
speaker, but there is some doubt as to whether a consistent theory 
of phonemes can be established on this basis (see Chap. XXIX). 

138. Though differences between the members of a phoneme 
are not readily perceived by native speakers of the language in 
which the phoneme occurs, yet they may be noticeable to a foreign 
hearer if they run counter to a usage in regard to similar sounds 
existing in his language. This is especially the case if the first 
language contains in one phoneme two sounds which belong to 
separate phonemes in the second language. This was brought 
home to me in a striking way once when a speaker of Urmian 
Syriac observed to me that the t’s in ten and letter sounded to him 
quite different; it afterwards transpired that in his native tongue 
aspirated and unaspirated t occur as separate phonemes, so that 
the (to me) slight difference in the amount of aspiration between 
the English stressed and unstressed t was very apparent to him. 1 

139. For similar reasons English people are very conscious of the 
difference between the vowels in the Arabic words batt 2 (he decided) 
and Jatt 3 (shore). This difference of vowel is immaterial to an Arabic 
speaker; to him the only essential difference is in the consonants. 

1 This was in 1912. My informant was a remarkable linguist and phonetician. 
S. Osipoff, whose mother tongue was Syriac, as spoken by about 10,000 
people then living in the neighbourhood of Lake Urmia. (Since then these 
people have had to migrate to Iraq and Syria, but they continue to speak 
the Syriac language.) 

2 A raised front a. 

8 A back sound of the a-type. 
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140. The fact that speakers of one language often notice 
differences which are immaterial to speakers of another language 
has sometimes been responsible for the introduction of unnecessary 
complications in spelling. Thus/, ts, ch, dz,j and sh were introduced 
into the old (Hepburnian) Roman writing of Japanese, because 
that system was inspired by English people who naturally identified 
cert ain Japanese sounds with English ones; the English find it difficul t 
to realize that these sounds belong to the h, t, t, z (or d), z (or d), 
and s phonemes respectively, and are, therefore, from the Japanese 
point of view, adequately provided for by the letters h, t, t, z (or d), 
z (or d) and s. (See §§323-331.) 

141. Th was introduced into the orthography hitherto used for 
Kikuyu for a similar reason. The originators of this orthography 
were English. Being accustomed to hear d and 8 as separate 
phonemes in English, they naturally wrote these sounds with d and 
th in accordance with English orthographical habits, wherever 
they heard them, not realizing that the distinction is immaterial 
in ordinary Kikuyu (see §79). The Kikuyu language contains the 
pairs b (3 and g y used in the same way as d 8, but since (3 and y 
are not English sounds there was no temptation to make similar 
provision for them in the orthography. 4 

142. The converse of the case referred to in the foregoing 
paragraphs is also very commonly met with. If two sounds are 
phonemically distinct in one language but not in another, speakers 
of the latter language are liable to confuse the sounds. Thus, 
English people learning Indian languages continually confuse the 
Indian dental t and retroflex [, because sounds of this character 
do not occur as members of separate phonemes in English; the 
English learner identifies them both with "the English t.” 5 6 

4 If Kikuyu orthography had been invented by Germans, it might well 

have happened that b and |3 would have been distinguished by the letters b 
and w, while the distinction d a would have been ignored. If it had been 
invented by Spaniards, only b, d and g would have been used to denote the 
plosive-fricative phonemes, since the Kikuyu use of p, 8, y is similar to the 
Spanish; the writing would then have been the simplest possible and in 
accord with the needs of the language. (See §79.) 

6 "The English t” is a phoneme comprising a good many sounds. The 
principal member is an alveolar t intermediate between the ordinary Indian 
t and ( (but nearer in sound to {)• (See §72, also my Outline of English 
Phonetics, third and subsequent editions, §§ 511-513.) 
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143. For a similar reason English people have difficulty in 
distinguishing between the German sounds J and 9 , despite the fact 
that a variety of 9 is used by some English speakers in such words 
as huge. They likewise find it difficult to distinguish the aspirated 
and unaspirated consonants of many languages, though degrees of 
aspiration are to be found in English. Spaniards do not distinguish 
the English d and 3 properly, though both sounds occur in their 
language. The Japanese fail to distinguish between such pairs of 
words as foot and hoot, though their language contains both h and 
a variety of f. 


CHAPTER X 

ASCERTAINING THE PHONEMES OF A LANGUAGE 

144. The fact that phonemes are the chief linguistic elements 
differentiating one word from another (§§50-53) generally furnishes 
us with the quickest and surest method of ascertaining the phonemes 
of languages. When an investigator desires to find out whether 
two sounds of a language belong to separate phonemes or not, he 
should try to find a pair of words distinguished solely by means of 
these two sounds. The discovery of such a pair of words is con¬ 
clusive proof that the sounds belong to separate phonemes. 

145. Thus the existence of the words bit, bet and bat in English 
suffices to show that the vowel-sounds in these words belong to 
separate phonemes; the existence of kat (cat), kot (cot) and kAt (cut) 
show that a and short o belong to phonemes distinct from these. 
The existence of the words ne (nez) and ns (nail) demonstrates 
that e and £ are separate phonemes in French. Lingual r and 
uvular r are shown to occur as separate phonemes in Provencal, 
as spoken at Arles, by pairs of words like 'ssro (evening), 'sero 
(saw, n.), ga'ri (to cure), ga'Ri (oak ). 1 And so on. 

146. The phonemic status of a sound may also be established 
by discovering whether it can be added to or subtracted from a 
sequence. Thus the existence of the English words ad, sad, sand, 
stand, shows that there are s, n and t phonemes in the language. 

1 Examples taken from H. N. Coustenoble’s La Phonitique du Provencal 
Moderne, p. 93. 
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147. If the observer knows that two sounds occur in a language 
but is unable to find a pair of words distinguished solely by an 
exchange of these sounds, he may still be able to prove that they 
belong to separate phonemes by finding two words containing the 
sounds in situations of sufficient similarity. The sounds belong to 
separate phonemes if he can show that their use is in no way depen¬ 
dent upon other differences that the words may contain. 

148. Thus I clearly feel a and a to belong to two separate 
phonemes in my type of English, 2 but I am unable to find proof 
by a pair of words differing solely by an exchange of one of these 
sounds for the other. The difficulty of proof is due to the fact 
that a rarely occurs with weak stress, while short a is always weakly 
stressed. However, the fact that my speech contains such words as 
'hikAp {hiccup), 'hAmdrAm {humdrum), 'katapAlt ( catapult ) as well 
as words like 'sirap {syrup), ka'nAndram ( conundrum), ‘difikalt 
{difficult) demonstrates the case to my satisfaction, since it cannot 
be maintained that the use of a and a in these weakly stressed 
positions is attributable in any way to the nature of the preceding 
sounds in the words. 3 (For another theory concerning these sounds 
see §§197 ff.) 

149. There is a similar case in German, 4 where there does not 
appear to be any pair of words differentiated solely by the use of 
the sounds £ and a. But the existence of the word 'edcnt {elend) 
beside [’feilant] ( fehlend) and numerous other words ending in 
[-ant] suffices to show that the two sounds belong to separate 
phonemes. (See further, §§224-229.) 

150. It is evident too that German has two separate phonemes 
h and g, though in the speech of many Germans there does not 
appear to be any pair of words distinguished by an interchange of 
these sounds. The fact that both sounds occur initially before 
various vowels suffices to show that there are two phonemes. 5 

2 One type of Southern British. 

3 With many Southern English speakers the case is proved by the existence 
of a few words containing strongly stressed short 3, and notably the words 
pronounced by them bi'kaz and djSSt (adverb), the latter word being distinct 
from the adjective d3ASt. (This does not apply to my pronunciation. I 
pronounce because as bi'koz, and the adverb just like the adjective, djASt.) 

1 Stage pronunciation as shown in Vietor’s Deutsches Ausspracheworterbuch. 

5 A pair is found in the speech of those who pronounce Chords as 'gorde 
(instead of the more usual 'korde): 'gorde, 'horde (Horde). 
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151. The sounds o and d of normal Parisian French probably 
exemplify the same point. In that type of French there does not 
appear to exist any pair of words distinguished solely by these 
sounds, but the words so :3 ( sauge ) and 10:3 (loge) suffice to demon¬ 
strate that the two vowels constitute separate phonemes . 6 

152. Sometimes when a pair of words cannot be found to 
demonstrate that two sounds belong to separate phonemes, proof 
can be given by means of a series of four words. Thus, as already 
mentioned in §53,1 do not think there is any pair of English words 
differing solely by the interchange of g and 3 , and which will therefore 
prove these sounds to belong to separate phonemes; but, in the 
words of Trager and Bloch , 7 “the lack of such a pair is easily 
supplied by the series singer : sitter, letter : leisure or ring : rim, 
room : rouge.” 

153. One cannot have a positive proof that two sounds must 
belong to one phoneme. They may, however, be presumed to 
be assignable to one phoneme if exhaustive tests fail to reveal 
any case of their occurrence in similar phonetic contexts. As 
long as it is felt that there is a possibility of the sounds occurring 
in the same phonetic context, they must be presumed to belong to 
separate phonemes. 

154. An examination of the examples in Chapter VII shows 
that in the great majority of cases there is no doubt at all as to 
what phoneme a sound belongs to. In most cases it would not 
occur to the average native to group the sounds otherwise than in 
the manner described in that chapter. He knows the grouping 
instinctively, and doubt only arises if different usages of other 
languages are brought to his notice. 

155. Cases are, however, found from time to time where the 
phonemic grouping of sounds is by no means clear, and can only 
be settled after prolonged investigation. Cases even appear to 
exist where it is not possible to make a satisfactory grouping at 
all, or where the grouping has to be arbitrary. Instances are given 
in Chapter XIV. 

6 There are other types of French in which conclusive examples are numerous. 
In Swiss-French, for example, peau and pot are distinguished as po and pD. 

7 The Syllabic Phonemes of English in Language, Vol. XVII, No. 3, July, 
1941, p. 229, footnote 10. 
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CHAPTER XI 

NEED FOR ACCURATE ANALYSIS 

156. It goes without saying that no phonemic classification 
can be made unless the sounds concerned have been analysed 
with great accuracy and all the conditions concerning their use 
have been fully established. A superficially made analysis may 
lead to erroneous classification. 

157. As an example of this we may take the nasal consonants 
of modern Provencal, as spoken at Arles, particulars of which 
have been supplied to me by Dr. H. Coustenoble, and are set out 
fully in her book, La Phonetique du Provencal Moderne. That 
language contains the sounds m, n and g. m and n occur before 
vowels but never at the end of isolated words; g never precedes 
a vowel in a word, but it occurs at the ends of words; it also occurs 
before k and g. Examples of these sounds are 'maire (mother), neu 
(snow), tsig (dog), 'blagko (white), veg'g® (come, p.p.). 

158. This distribution might lead one to think at first sight 
that g must belong to the same phoneme as n. On further examina¬ 
tion, however, it is seen that this cannot well be the case. For we 
find that this language is peculiar in containing some words which 
when said in isolation terminate in g, but which are pronounced 
with n when a word beginning with a vowel follows. In the case 
of some words this substitution is always made, in the case of others 
it is sometimes made and sometimes not, according to the nature 
of the expression. Thus: (1) sug (are) is always pronounced with 
g (or before certain consonants mg or nrj, see §160), e.g. sug ami 
(they are friends), sug aid'ba (they have arrived), sug eg tRig (they 
are in process (of)), but sug (his) is always replaced by the form 
sun when the following word begins with a vowel, e.g. sun us'tau 
(his house), sun 'ugkle (his uncle), sun eg'tRig (his animation), 
(2) beg (well, very) is generally pronounced with g before vowels, 
but it is pronounced ben in various common expressions, e.g. 
beg a'vag (a long time before), beg au (very high), but ben 0 'rus 
(very happy), ben o'tiile (very useful). 

159. The above examples show that n and g must be assigned 
to separate phonemes, unless it could be found possible to consider 
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the forms with n as prefixes forming single "words" with what 
follows. It does not, however, seem feasible to consider them so, 
since the principles, such as they are, which determine whether 
n or g is used cannot be specified with precision. These principles 
are moreover syntactic or semantic, and not phonetic. 1 

160. Two other nasal consonant sounds are found in Provensal, 
one of which is used when p or b follows, and the other when t 
or d follows. To an unpractised ear they sound at first like m 
and n respectively, and one might easily be tempted to assign 
them to the m and n phonemes. By listening carefully, however, 
one can hear that the sounds are not ordinary m and n at all, 
but m and n pronounced with a simultaneous g. Using the notation 
mg, ng to represent these sounds, examples can be written in narrow 
transcription thus: [emgple'gaido] {employee), ['kangte] (I sing). 
The sound mg is of course "related in character” to m, but there 
is nothing in the nature of a "similitude” 2 to justify assigning to 
the m-phoneme the sound mi) occurring before p and b. The 
use of a labialized g before p and b is, however, a clear case of 
similitude involving the assignment of mg to the g-phoneme. 
Similarly with ng. Proven 5 al mg and ng must therefore be con¬ 
sidered as members of the g-phoneme, and should be written 
simply with g in broad phonetic notation. The above cases thus 
illustrate how a superficial analysis, based on insufficient material 
or incorrect identification of sounds, might cause two serious 
errors in phonemic classification, namely, to assign g to the n- 
phoneme and to assign nig and ng to the m and n phonemes. 

161. The need for accurate analysis may be exemplified by 
two further instances of errors of classification which have been 
made in the past. The first concerns the Hindi vowel sound 
hitherto generally written by romanizers with short a when h 
and another consonant follows, as in the older transcriptions 
pahla (first), sha.hr (city), kahnd (to speak). The sound in such 
words is, at any rate in the pronunciation of Lahore, rather E-like 
in character and resembles the Southern English vowel of bag. 


1 n is used when there is very close grammatical connexion between the 
words, or when the expressions are very common. It is probably impossible 
to define the required degrees of “closeness” and "commonness." 

* See my Outline of English Phonetics, §835. 
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These transcriptions imply that this e-like sound is a member of 
the phoneme whose principal member is the A-like sound heard 
in the words written in older transcriptions das (ten), phal (fruit), 
garni (warm). Recent research has, however, shown that the above 
e-like sound has nothing to do with the A-phoneme, but is identical 
with or similar to the slightly diphthongal sound which the older 
transcribers wrote at. It has also been established that there is 
not really a separate h following the vowel in pahla, etc., but that 
the vowel is said with "breathy voice” which continues throughout 
its whole duration; the sound formerly written ah before a consonant 
is in fact an “aitchified” vowel, to use a term suggested by J. R. 
Firth, who was the discoverer of the correct phonemic classification 
of the sounds under consideration. (It is convenient to represent 
aitchified vowels in practical writing by digraphs® consisting of 
vowel letters followed by h.) 

162. In the proposed new All-India Romanic Orthography 
the correct phonemic classification is followed. The A-phoneme 
is written, very suitably, with a, so that the old transcriptions 
das, phal, garni are replaced by das, phal, garni. The old at is 
written ay, the old "Roman Urdu” forms hat (is), baithna (to sit), 
etc., being replaced by hay, bay[hna, etc. And words formerly 
written with ah + consonant are in the proposed new orthography 
properly written with ayh, in accordance with the correct phonemic 
classification of the vowel, thus payhla, fayhr, kayhna. 

163. Similar considerations apply to the Hindi sound written 
awh in All-India Romanic, e.g. in bawht (much). In the older 
romanic transcriptions this word was written bahut, a form 
indicating a misconception of the phonemic make-up of the word. 
The vowel really is the aitchified form of the slightly diphthongal 
sound so heard, for instance, in awr (and) which the older romanizers 
wrote aurf 

164. My last example is taken from the Tswana language. 
It is an instance where an incorrect phonemic analysis is reflected 
in current orthography, though fortunately without any untoward 
results. Tswana contains an ordinary J which is pronounced with 


s Or, in the case of diphthongs, trigraphs. 

4 For particulars of the proposed All-India Romanic Orthography, see my 
paper on The Problem of a National Script for India, which incorporates 
Firth’s discoveries. 
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spread lips; it also contains a strongly labialized J which may be 
denoted by the symbol \. 5 Both these sounds occur before the 
vowels i, i, e, e and a, and in consequence they constitute separate 
phonemes. Examples illustrating their phonemic use are Ja (die), 
\a (bum), stjeje (flower), ^eu (white). Before the remaining vowels 
d, o, o and u, only \ occurs; the non-labialized J is never used before 
these vowels by the native speakers I have heard. Examples are 
\5ka (twist), “jotla (harass, mock), "juta (move). 

165. Some observers have treated \ as if it were not a separate 
phoneme. Before i, i, e, e and a they have taken it for a sequence 
of J and w, and before o, o, o and u they have identified it with J. 
This incorrect analysis is no doubt the origin of old spellings 
such as sha, shwa, sesheshe, shweu, shdka, shotla, shuta. Phonetic 
consistency would require writing skwdka, shwotla, shwuta. 

166. It so happens, however* that the incorrect analysis leads 
in this case to the same style of orthography as that derived from 
the correct analysis. The orthographic forms given above, or their 
more phonetic equivalents fa, fwa, sifefe, fweu, foka, fotla, futa, 
are justifiable on grounds of practical convenience, though not on 
grounds of phonetic exactitude. It is convenient in Tswana to 
use w as a general symbol denoting labialization, and to use 
it in the representation of various labialized consonants, fw can 
thus be used as a digraph denoting Then the w may be 
omitted by convention before o, o, a and u, to economize letters in 
the spelling, since no ambiguity is caused by doing this. It must be 
remembered, however, that though the result might be a convenient 
orthography on phonetic basis, 6 it is not an accurate phonetic 
transcription. For teaching a good pronunciation of the language 
an accurate phonetic transcription is required; this should be on 
phonemic basis and should consequently make use of either a 
special symbol for \ or the digraph Jw, which should be employed 
before o, o, o and u as well as in such words as Jwa, Jweu. - 


6 The labialization is of an unusual type. The lips are rounded, and the 
lower lip is so held that the upper teeth touch it at a place about three-quarters 
of an inch behind its edge. This kind of labialization gives a peculiar acoustic 
effect. 

6 In the new orthography proposed by A. N. Tucker the vowel represen¬ 
tation would be different from that indicated here. See his article Sotho- 
Nguni Orthography referred to in §482, footnote 18. 
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CHAPTER XII 

DIFFICULTIES IN IDENTIFYING SOUNDS IN 
SPECIAL SEQUENCES 

167. In the preceding chapter emphasis was laid on the need 
for the accurate analysis of sounds as an essential preliminary to 
attempting any phonemic classification. Mention must now be 
made of the fact that sequences are occasionally such as to render 
a sound difficult to identify, with the result that there may be 
some uncertainty as to the appropriate phonemic grouping. 

168. Such cases may be classified under two heads: (1) where 
two sounds, though acoustically rather similar, are recognized as 
distinct by the speaker and therefore may have “distinctive” 
function, (2) where the sounds have acoustic similarity but are so 
placed that they do not have "distinctive” function. 

169. The English sounds s and voiceless z (z) furnish an illustra¬ 
tion of these cases. 1 There are, as is well known, certain phonetic 
situations in which both of these sounds can occur. They can, 
for instance, both occur initially, as in si:l {seal) and zi:l ( zeal ) 2 ; 
an English speaker can sound either of these syllables at will, and 
an English hearer will usually identify the consonants correctly 
in spite of their resemblance to one another. 3 For this reason the 
sounds s and z belong to separate phonemes. 

170. In some sequences, however, it is difficult to hear which 
sound is employed. Instances of the first type (§168) are found 
in such words as obstruct, abstinence, Beaconsfield. It is sometimes 
nearly impossible for a hearer to tell objectively whether a speaker 


1 These sounds differ from each other solely in the degree of breath force, 
S having strong force and z being a similar sound uttered with weak force. 

2 The reference here is to the pronunciation of those numerous English 
speakers who do not use voiced z initially. 

3 In final position after a long vowel the identification is facilitated by the 
difference of vowel-length which accompanies these consonants. Thus the 
Ui in the sequence -u:s is easily heard to be much shorter than that in -u:j. 
(See §§188-190.) 
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is pronouncing ab'strAkt, 'abstinans, 'bekansfi:ld or abz'trAkt, 
'abztinans, 'bekan?fi:ld; the speaker, however, knows by his sub¬ 
jective feeling, which pronunciation he is using. 4 English people 
who habitually pronounce these words in one of these ways are 
able, if they so wish, to pronounce them in the other way. The 
sequences bst, bzt, nsf, nzf are all familiar to him, though possibly 
he may use bzt at word-junctions only; the "distinctive” function 
of the sounds is present at any rate in theory. 5 

171. An instance of the second type (§168) is afforded by the 
terminal consonant of such words as puts, drinks, box. The pro¬ 
nunciation usually transcribed is puts, drigks, boks, and the 
pronunciation of these words with a normal s is usual. The case 
is, however, one where the degree of force of the s is immaterial, so 
that weak varieties of s are frequently used. In particular the 
use of ? (which = a weak s) is not by any means uncommon: 
put?, drirjkz, bokz. In this situation the distinction between s 
and z is not “significant.” 6 

172. The question therefore arises whether the ? in this pro¬ 
nunciation of these words should be regarded as a member of the 
z-phoneme, as it is in other connexions, or whether it should 
be assigned to the s-phoneme in this particular case. In other 
terms, is the z in these words to be regarded functionally as a z, or 
are we to take account of the fact that organically it is merely a 
weak s? 

173. It appears to me that this z, like other z’s in English, 
should be regarded as a member of the z-phoneme. I feel this 


* Speakers can doubtless be found who use voiced Z in these words. With 
them the identification of the sound would present no difficulty to the hearer. 

* It is doubtful if there exists any pair of English words distinguished 
solely by bst and b?t. If there were such words, hearers would undoubtedly 
have difficulty in identifying them. Even at word-junctions differences of 
this sort are difficult for the hearer to perceive: Bob struck ('bob ’stTAk) 
and Bob's truck ('bob? trAk) are very much alike to the hearer, as also are a 
a good strain (a 'gud Strein, said with contrast emphasis on good) and a goods 
train (a 'gud? trein). nsf and n?f occur only at word-junctions or in com¬ 
pound words such as 'hens'fo:0 ( henceforth ) and one pronunciation of 
Beaconsfield. 

6 Voiced Z is not used by any English speaker in such a position. This 
fact is, however, not relevant to the present discussion. 
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to be necessary on the general ground that the assignment of a 
sound to two phonemes in the same language is inadmissible on 
principle, and that such a plan should only be resorted to if , a case 
should arise where no other course is open (see §35, and Chaps. 
XIX and XX). This recommendation involves writing putz, 
drirjkz, bokz, etc., in a broad transcription of the speech of those 
who pronounce putz, drirjkz, bok?, etc. 

174. It is worth noting that in my experience English students 
of phonetics not unfrequently write forms like putz, drirjkz, in 
their transcription exercises. In the past I have corrected these 
as errors, but I now think that such transcriptions may not always 
be erroneous. Often they are errors no doubt, but sometimes it 
may happen that a student uses the sound z in such words because 
he subjectively feels the sound to be a z rather than a s. This 
experience may afford some additional support to the view expressed 
in §173. 

175. It should be noted further that many English speakers 
have erratic pronunciation of words like puts, pronouncing sometimes 
puts and sometimes putz for no assignable reason. It is shown in 
Chapter XXVIII that erratic pronunciation cannot be reduced 
to phonemes. Phonemic classification of the final consonant of 
puts can only be made if the speech recorded is consistent. In the 
speech of a person who consistently uses a s-sound, that sound 
belongs to the s-phoneme even if it is a somewhat weakened form 
of the consonant; in the speech of one who definitely uses z, and 
uses it consistently, the sound should, it seems to me, be assigned 
to the z-phoneme. 

176. The following is an example from French of sounds which 
are difficult to identify in special sequences. In that language 3 
occurs as a member of the 3 -phoneme; it is distinct from J, and 
the two sounds may occur in similar situations. Thus ne bouge pas 
is commonly pronounced na bu 3 pa, and this is distinct from 
ne bouche pas (na buj pa). When, however, $ occurs initially 
before a voiceless consonant, it is more difficult to distinguish 
from J, and in fact J is often substituted for it. Thus je te crois 
is pronounced J ta krwa as well as 3 ta krwa. Both pronunciations 
exist, and it is difficult to hear the difference between the sounds 
in this position. It will, I think, be generally agreed that, as in 
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the English case cited in §§170—173, when 3 is used it should be 
assigned to the 3 -phoneme. 

177. The 3 -like sound used by many English speakers in such 
expressions as please yourself, Mrs. Young, is sometimes difficult 
to identify. To determine its phonemic status it is necessary to 
make certain whether the speaker under observation uses a sound 
intermediate between z and 3 ('pli: 3 z joi'self, misi 3 z 1 j aq) or whether 
he uses a real 3 as in pres'tig (prestige ). The use of a real 3 identical 
with that in pres'tig means that in such an expression as please 
yourself the speaker substitutes the 3 -phoneme for the z-phoneme 
ordinarily used in the word please. I should regard it as inadmissible 
to assign such a 3 to the z-phoneme (see footnote 1 to §315). When, 
however, the sound in please yourself is near to 3 but not identical 
with it, it cannot in my opinion be regarded as other than a sub¬ 
sidiary member of the z-phoneme. 

178. Similar considerations apply to the terminal consonant of 
dress in dress-shirt. With many the pronunciation is 'drej'jait, the 
J being an ordinary one exactly like that at the end of fij in 'fijjop 
( fish-shop). The pronunciation 'drej'jait is the result of a con¬ 
textual assimilation by which the speaker substitutes his J- 
phoneme for his s-phoneme. Others pronounce the first syllable 
of dress-shirt with a sound which is not definitely J, but is a J-like 
variety of s. Their speech exemplifies a similitude involving the 
use of a particular (J-like) member of the s-phoneme. This 
pronunciation would be transcribed broadly as 'dres'Jsit . 7 

179. Instances of difficulty in identifying sounds are also to 
be found in Russian. In certain sequences it is difficult to hear 
whether a consonant is a "soft” (palatalized) or a “hard” (non- 
palatalized) one. This is the case, for instance, with the first 
sound of the groups stj, znj, slj, zlj when initial, as when the words 
stjix (cthx, verse), 'zdjejnjij (3flenranft, local), 'sljiva (cjiHBa, plum) 
are said in isolation. On examination it is found that they are 
soft 8 ; the best proof is furnished by the sounds of vowels immediately 
preceding these words in connected speech, for which see §183. 


7 For the difference between assimilation and similitude, see my Outline 
of English Phonetics, Chapter XXVI. 

8 The degree of softness is not the same in every case. For instance, the 
S is softer in Stj than in snj. 
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CHAPTER XIII 

IDENTIFICATION OF SOUNDS BY CHARACTERISTICS 
OF ADJOINING SOUNDS 

(1) Recognition of Sounds through Subsidiary Members 
of an Adjacent Phoneme 

180. It occasionally happens that the difference between two 
groups is made evident not so much by the sounds of the distin¬ 
guishing phonemes as by special members of an adjacent phoneme. 
This occurs when the two distinguishing phonemes have members 
resembling each other and the adjacent phoneme has clearly distinct 
members which have to be employed in juxtaposition to those 
phonemes. 

181. The case is illustrated by the Danish groups which would 
be written broadly -an’ and -ag’, to which attention has already 
been called in §92. In Danish, when a nasal consonant is in final 
position and pronounced with simultaneous glottal stop, a devoiced, 
or almost completely devoiced, variety is used. In the case of 
n and g the resulting sounds n and rj are acoustically very near to 
each other. Now, as we have seen, when the a-phoneme precedes 
the n-phoneme in Danish a front a is used, but when it precedes 
the g-phoneme the vowel-sound is a back a. The above sequences 
could therefore be transcribed narrowly as [-ag’] and [-ag’]. It 
will be readily appreciated that these sequences are recognized 
much more by the vowel qualities than by the sounds of the 
consonants. 

182. One cannot cite this case as a ground for assigning a and a 
to separate phonemes in Danish. It is clear from other considera¬ 
tions that the g and g must be regarded as the primary distinguishing 
sounds of such words as [sag’] and [sarj’]. In the first place the 
ordinary voiced n and g are much commoner in Danish than the 
voiceless sounds used in the above special case, and instances such 
as 'onar ['oner, evils) and 'ogar (unger, young ones) show that 
these voiced sounds must be held to belong to separate phonemes. 
Moreover, though the sounds n and g require that special types of 
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a-sound should precede them, yet no analogous differences are 
found when other vowels precede. 1 It must be noted too that 
although there exist some varieties of q in Danish, there is always 
a definite line of demarcation between the q and n sounds. This 
cannot be said of the Danish a-sounds. Several varieties of these 
are used according to circumstances, and there is no line of demarca¬ 
tion between front varieties and back ones; the various shades 
merge imperceptibly into one another. 

183. Certain of the Russian soft (palatalized) consonants are 
always quite distinct in quality from the corresponding hard (non- 
palatalized) sounds. This is the case more particularly with lj 2 
and I. But there are some pairs, and notably nj 2 and n, sj and s, 
zj and z, which are heard as quite distinct sounds in some phonetic 
contexts but are much more alike in others. Thus the nj in 
'njanja (hhhh, nurse), for instance, is a very different sound from 
the n of 'nada (Haao, it is necessary). But when a consonant 
follows, and when therefore the sounds are deprived of their off- 
glides, the difference between them is by no means so clear; it 
would indeed be difficult to distinguish such sequences as njk and 
nk, were it not for the fact that vowels preceding soft consonants 
always have different shades of quality from vowels preceding 
hard consonants. For instance, the sequences in the words 'vanjka 
(BaHBKa, diminutive of Ivan) and 'banka (fiaHKa, jar) are really 
recognized more by the vowels than by the qualities of the nasal 
consonants. 3 (The first a in 'vanjka is a fronter variety than that 
in 'banka.) Similarly, the softness of the s in 'pjesnja (song) and 
the hardness of the m in 'zjem-lji (lands) is heard chiefly by the 
closeness of the e in the first word and the comparative openness 
of the e in the syllable zjem-; narrow monographic transcriptions 


1 Thus the vowels in 'eqa (Inge, proper name), 'leqs ( Itznge, long) and 
'oqar are not appreciably different from those in 'ena (inde, in), 'sn3 (ende, 
end) and 'onar respectively. 

2 See footnote 15 to §95. 

3 The glides from the vowels to the consonants also help to show which 
type of consonant is used. There is always an i-glide between a vowel and 
a following soft consonant. The importance of glides in this connexion was, 
I believe, first pointed out by S. Boyanus. See his article Palatalization of 
Consonants in Juxtaposition in Russian published in the Transactions of the 
Philological Society, 1943. 
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of these words would be 'pe^ris, '^smlt. Again, the fact that the 
s and z of such words as stjix (cthx, verse), 'zdjejnjij (anemmift, 
local), 'sljiva (cjihbb, plum) are soft, is shown by the qualities of 
the preceding vowels in such expressions as na 'stjix (Ha cthx, in 
a verse), tje 'sljivi 4 (Te cjihbh, those plums). The a in na 'stjix is 
a distinctly fronter variety than that in, for instance, na 'sontsi 
(Ha cojiHue, in the sun), and the e in tje 'sljivi is noticeably closer 
than that in tje sta'li 5 (Te ctojih, those tables). 6 

184. So also in Estonian. Such Estonian words as kstjtki 
(cradle) and kettki (even the hand) are recognized mainly by the 
variety of e used and by the j-glide preceding the $. See L. Krass, 
The Phonetics of Estonian, p. 90. 

185. Cases of a similar nature are to be found in Arabic. The 
pharyngalized consonants and the corresponding non-pharyngalized 
consonants of the language are very distinct in sound when said 
in isolation or when adjacent to i or e, but when adjacent to a or o 
or u the difference between the consonants is less clearly audible. 
In these cases the consonant differences are helped out by notable 
differences in the shades of the vowels adjoining them. Thus 
bass (only) and bass (bus) are probably recognized a good deal 
more by the fact that bass has a forward a and bass a back a [a] 
than by the difference in the qualities of the consonants. 

186. An illustration of a different kind has been discovered by 
Professor Ida Ward in the Igbo language. The vowels u and a 
of that language are rather near to each other in sound, and the 
distinction between them is often much facilitated by other vowels 
occurring in the same words with them, as explained in §509. 

(2) Recognition of Sounds through Length of an 
Adjacent Sound 

187. Sometimes a sound is recognized not so much by its 
inherent tamber as by the length of an adjacent sound. 


4 Narrowly [tje 'sljivt]. 

5 Narrowly [tje Sta'li]. 

6 There are also audible i-glides before the S-sounds in na 'stjix, tje 'sljivi. 
Many examples illustrating this point have been given by Boyanus in the 
article mentioned in footnote 3. 
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188. The case is illustrated by the English consonants p, t, f, s, 
etc., and b, d, v, etc., when they occur in final positions. Pre¬ 
ceding vowels are much shorter in words like seat, leaf, loose, than 
in words like seed, leave, lose. The words might be transcribed 
narrowly as [si-t, li’f, lu’s, si:^, Iky, luiz], and I think there is no 
doubt that in such sequences the length of the vowel contributes 
more to the distinctions than the qualities of the consonants do. 7 
(See also §169.) 

189. The means of recognizing such sequences may be tested 
by a process of reversal, e.g. pronouncing groups like [si-d] 
(with half-long i:) and [skt] (with fully long i:), [I by] (with half- 
long i:) and [li:f] (with fully long i:). My impression is that [fi’d], 
[li-v] would be taken by the average English hearer to mean feet, 
leaf. And certainly the incorrect use of fully long i: by Germans 
when they pronounce such English words as feet, leaf gives to 
many English people the impression that they are saying the words 
with d and v. 

190. There cannot be any question in a case like this of assigning 
d and t to the same phoneme and treating the difference in vowel 
length as the essential distinguishing feature. One reason for this 
is that English speakers who use voiceless d in final position as a 
rule use it also in initial position, and the numerous pairs of words 
like den and ten show d and t to belong to separate phonemes. 
Another is that numberless degrees of length of the English "long” 
i: are found in different phonetic contexts, with the result that it 
is impracticable to group them into more than one chroneme 
(§§390, 403). 

191. Whether a final consonant is b, d, etc., or p, t, etc., is 
often shown in English by the length of a preceding consonant. 
Thus fence and pens are distinguished more by the length of the 
n than by the quality of the final consonant; the words may be 
written narrowly [pens], [peniz]. Nevertheless, for reasons similar 
to those given above, it is necessary to regard the s and z as con¬ 
stituting the primary means of differentiating the words. 8 


7 Reference is here made to the pronunciation of the very numerous English 
speakers who do not voice final b, d, V, Z, etc. 

8 It is interesting to notice that the sequence [penis] (with long n and 
normal s) gives the impression of something between pence and pens, while 
[peiiz] (with very short n) gives the impression of pence rather than pens. 
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CHAPTER XIV 

SOME DIFFICULTIES IN ESTABLISHING PHONEMIC 
GROUPINGS 

192. In Chapters X and XI it was shown that cases are sometimes 
found where an appropriate phonemic grouping is not immediately 
clear on account of some difficulty in recognizing the sounds used. 
The present chapter deals with difficulties in arriving at satisfactory 
phonemic classifications when sounds do not fall into obvious and 
natural groups, but have to be “assigned” to phonemes as the 
result of a more or less involved process of reasoning. There are 
two causes of difficulties of this nature: (1) where there is a possibility 
of more than one method of grouping the sounds, and (2) where 
the controlling principles (§30) are complex or difficult to ascertain. 

(1) Possibility of Grouping Sounds in More than One Way 

193. Circumstances occasionally arise in which there are, or 
would appear to be at first sight, two possible phonemic groupings 
of a sound—where in fact we can “assign” a sound to one of two 
phonemes, or where it seems possible either to group a sound into 
the same phoneme as other sounds of the language or to regard 
it as constituting a separate phoneme. 

194. Such possibilities are found in Korean, and may be 
exemplified by the bi-labial sounds occurring in that language. 
There are five non-nasal bi-labial sounds, namely, strongly aspirated 
p (ph), slightly aspirated p (p'), unaspirated (and perhaps glottalized) 
p, unexploded p (pj, and b. An examination of the situations in 
which these five sounds occur shows that they can be classed into 
three phonemes, ph, p' and unaspirated p all occur initially and 
therefore belong to separate phonemes, b occurs only medially; 
ph and unaspirated p also occur medially, but p' does not occur 
medially. We find ourselves therefore under the unusual necessity 
of assigning b to the same phoneme as p', and of denoting this 
phoneme by the letter b. Unexploded p > only occurs finally or 
preceding a consonant, while none of the other sounds occur in 
these positions, p^ may therefore be assigned -to any one of the 
three above-mentioned phonemes. As it has greater affinity to 
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unaspirated p than to ph or p', I think it should be assigned to the 
same phoneme as unaspirated p . 1 

195. Another instance where the appropriate phonemic grouping 
is not immediately obvious is the case of the Pekingese fricative 
consonant which may be written narrowly 0 . In Pekingese 0 
only occurs before i and y, as in / 0 \ (practice), - <aig (star), "isiag 
(box), N gyn (teach), 'eye (learn). There are, however, three other 
fricative consonants occurring solely before vowels other than i 
and y, namely, x, s and J\ 0 may therefore be assigned to the 
same phoneme as one of these. It has certain phonetic affinities 
to each of them, and doubt may well be felt as to which should 
be chosen. It would seem that those Chinese who have considered 
the matter feel 0 to be acoustically nearer to J than to x or s; if 
this is so, the sound may on that ground be assigned to the J- 
phoneme. 

196. But if there were to exist (as is quite possible) varieties 
of Chinese pronunciation in which the degrees of acoustic separation 
of 0 from s and x and J are definitely about equal, how should 0 
be phonemically classified? This would be a case where it would 
be immaterial which phoneme 0 was assigned to. The choice 
could be made either arbitrarily, or by taking into account external 
considerations unconnected with the idea of the phoneme. One 
such external consideration would be the fact that in other varieties 
of Chinese the speakers apparently feel 0 to have special affinity 
to J. Another would be the history of the sounds. This latter 
would point to a different solution, for it is believed that some of 
the g’s of modern Pekingese are derived from an earlier s, while 
others are derived from an earlier x, but that none are derived 
from an old Chinese J. J might then be excluded for this reason. 
But even the history of the sounds does not furnish us with any 
reason for preferring s to x or vice versa. This hypothetical case 
is in fact one where, as far as I can see, no preference is possible, 
and it would be completely immaterial whether the 0 were assigned 


1 The Korean phonetician, Sheon-Gi Gim, thinks it better to regard un- 
exploded p, as a separate phoneme, on the ground that being not plosive at all 
it is not sufficiently "related in character” to justify including it in the same 
phoneme with ph, p‘ (b) or p. See his Phonetics of Korean (thesis for the M.A. 
degree in the University of London). 
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to the s-phoneme or to the x-phoneme; the choice would have 
to be made arbitrarily. 2 

197. Instances where doubt may arise as to the best method 
of phonemic grouping, when the controlling principles offer no 
particular complexity, are also to be found in European languages. 
Notable is the case of the Southern English vowels in words like 
bird, cup and the first vowel of along. Sweet and most other 
English authorities have treated the vowel of bird as belonging 
to the same phoneme as the first vowel of along, but, as will be 
seen from what follows, there is a good case for regarding the vowels 
of bird and cup as belonging to the same phoneme. Paul Passy 
was in favour of so regarding it. 3 

198. That both courses have to be considered is shown by the 
fact that the vowel-sound of bird is always longer than both the 
other vowels in comparable phonetic contexts. Representing the 
quality of the vowel in bird narrowly by b for the purpose of this 
discussion, we may exemplify the case as follows: the b of bed 
[bird) is longer than the a of bAd (bud), the b of het (hurt) is longer 
than the a of hAt (hut), the e of he'beijas ( herbaceous) is longer than 
the first a of ha'bitjual (habitual), the e of 'reiben (Raeburn) is 
longer than the a of 'riban (ribbon), and so on. 

199. If these examples represented the only facts to be con¬ 
sidered. it would be immaterial which phoneme b was assigned to. 
There are, however, other factors to be taken into account. In 
the first place "the vowel of bird” is an ambiguous expression. 
Different Southern English speakers use widely differing qualities 
of vowel in this and similar words, and it may well be that a classi¬ 
fication applicable to one speaker is not necessarily suitable to the 
pronunciation of another. The e of some people is a rather open 
sound though quite distinct from a. Henry Sweet was one of 
these, and I remember well the quality of his vowel. He described 
the sound as “low-mixed-narrow”; 4 it was quite different in quality 
from the a he used in such a word as a 1 log, and distinct too from 

2 Another course would be to regard g as a separate phoneme. It seems, 
however, unsuitable to do this in the case of a sound which so clearly lends 
itself to grouping with another sound. 

3 See his Petite Phonetique Comparee, §249, and the Southern English text 
on p. 127 of the same book. 

4 Sweet, Primer of Spoken English, p. 5. 
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his a which was rather a-like. My e is a much closer sound. 5 It is 
almost indistinguishable in quality from my a in a'lorj, and may 
be regarded practically as this a lengthened. With others the 
quality is near to a. With others again the e is rather lu-like in 
sound; this variety was observable, for instance, in the speech of 
Dr. Temple, the former Archbishop of Canterbury. 

200. The existence of these varieties of pronunciation may 
explain in part the divergence of opinion regarding the phonemic 
status of e, since the authorities undoubtedly do not all refer to 
the same sound. Those whose e resembles a might be expected 
to favour attaching it to the A-phoneme, regarding it as “the long” 
of a, while those who like myself use a sound almost identical in 
quality with the a of a'loQ are naturally disposed to group these 
two together into one phoneme.® 

201. A further complication arises from the fact that Southern 
English speakers use several varieties of unstressed a depending 
upon phonetic context.’ In particular it is customary to use a 
particularly open variety in final positions, as when such words 
as 'tjaina [china), 'ouva [over) occur with a pause after them. 6 7 8 
The existence of these varieties of short unstressed a appears to 
me to tell in favour of assigning e to the same phoneme as these 
a’s, since it is probable that most people’s e coincides in quality 
with one of their short unstressed a’s, just as my vowel of bird 
is practically indistinguishable from my a of a 1 log. It is therefore 

6 Outline of English Phonetics, p. 86. See particularly Figure 61 (photograph 
of the mouth of my brother whose vowel in bird was similar to mine). 

• Sweet, in spite of the openness' of his e, treated the vowel as belonging 
to the same phoneme as a. He wrote it aa. He treated A as "the short" 
of the vowel in calm, writing calm as kaam and come as kam. (He represented 
the ordinary short a by a.) 

7 See Outline of English Phonetics, pp.. 88-91. 

* Some people use a sound indistinguishable in quality from their a in 
words like china, over, discover, colour, when the words are said with a pause 
following ( Outline, §§362, 363). I tend to do it myself when speaking loudly; 
the pronunciation was observable in gramophone records which I made before 
the days of electric recording, when one had to speak very loudly into a horn 
to obtain clear results. In this pronunciation these final a’s, which incidentally 
are often rather long, must be held to belong to the same phoneme as all the 
other a’s. The argument in §201 is concerned with the speech of those whose 
final vowel in china, over, etc., though rather open, is clearly distinguishable 
from A- 
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logical to assign the two sounds to the same phoneme on the ground 
that it is undesirable to allocate one vowel quality to two phonemes. 

202. I am disposed to think then that as long as the speaker’s 
2 is distinct in quality, as well as in duration, from his a, it is 
preferable that we should group it into the same phoneme as the 
short unstressed a’s. (This involves writing 2 as a: in broad 
transcription.) It must be admitted, however, that those who 
favour assigning 2 to the A-phoneme (a classification which would 
involve writing 2 as a: in broad transcription) also have a good case. 
They may adduce the fact that 2 and a are both stable in sound 
in the pronunciation of any given speaker—that there are no 
noticeable variants of either conditioned by sounds adjoining 
them in sequences. They are straightforward sounds to deal 
with; grouping them together as “long and short a” would not 
cause inconvenience, were it not for the fact that 2 is practically 
identical in quality with one variety of a. The English unstressed 
short a’s are on the contrary elusive sounds, difficult to analyse, 
and subject to wide "variations” according to phonetic context. 
It may well be argued that it is convenient to keep them together 
in one phoneme separate from the longer and stressable 2 . It 
may be remarked also that while 2 and a are very frequently found 
in similar phonetic contexts, these sounds rarely occur in the same 
phonetic context as a. 9 These arguments, however, do not seem 
to me to be of sufficient cogency to outweigh the general principle 
that one sound-quality should not be assigned to two phonemes 
if it can possibly be helped. (See §35, also Chaps. XIX and XX.) 

203. There are other types of English in which doubt may be 
felt as to the best phonemic classification of a and a for reasons 
different from those given in §§197-202. I refer to the pronuncia¬ 
tion of those whose speech does not contain a long a: at all, but 
who pronounce words like burn, work either with a sequence aj 
or with a monophthongal retroflexed or j-coloured a, thus bajn, 
wajk or bjn, wjk. Pronunciations of these types are common 
in the West and North-West of England and in America. 10 


* Some would say that they never occur in the same phonetic context as 
a —that “weakly stressed” 2 and A in reality have a semi-strong stress. 
(See §204.) 

10 The usual American j has a somewhat different quality from the Western 
English J. 
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204. As in these types of English neither a nor a occurs long, 
the question arises whether the two sounds constitute separate 
phonemes or whether any grounds can be found for grouping them 
together in one phoneme. Personally, I take the view that both 
sounds occur in comparable phonetic contexts, such as those 
mentioned in §148, and therefore constitute separate phonemes. 
The opinion has, however, been expressed 11 that these contexts 
are not comparable, and that every so-called weakly stressed a 
has in fact a semi-strong stress. Those who hold this opinion 
appear to consider that not only a but also the English e, a and o 
sounds are incapable of being uttered with weak stress. It seems 
to me to be difficult to maintain such a proposition. To me it 
appears more logical to postulate that any vowel-sound can be 
pronounced with any degree of stress. The opener vowels, it is 
true, generally have greater prominence than sounds of the types s, 
i and u; but this is probably accounted for by their greater inherent 
carrying power, 12 and sometimes also to extreme shortness of the 
a, i and u sounds. Non-final a is generally elidable in British 
English, i.e. its elision does not as a rule affect the meanings of 
words. Therefore its precise quality is, within limits, immaterial. 
Weakly stressed e, a, o and a, on the other hand, are not elidable; 
they have to be long enough to make their qualities distinctly 
recognizable. They are thus more prominent (by length) than a, 
but this does not necessarily mean that they have stronger stress. 

205. I have explained elsewhere 13 that I take stress to be 
largely a subjective feeling of forcibleness or effort in the action 
of producing sounds, and I may add here that I am doubtful if 
it will ever be found possible to determine degrees of stress by 


11 E.g. by G. L. Trager and B. Bloch. (See Trager and Bloch, The 
Syllabic Phonemes of English in American Speech, Vol. XVII, No. 3, July, 
1941, p. 227, and tbe transcriptions by Trager in Le Mattre Phonilique, 
January, 1935, p. 11, and April, 1941, p. 17; also the note by Bloch in Le 
Mattre Phonitique, January, 1943, p. 4.) 

12 The term "inherent carrying power” has been objected to by some on 
theoretical grounds. I submit, however, that it is a justifiable term to use 
in linguistic work, since it is a matter of experience that close vowels can only 
be made to carry as well as opener vowels by uttering them with greater 
effort (e.g. in attempting to get comparable tone-volumes when making 
gramophone records of vowels). (See also §437.) 

13 §425 of this book. Also Outline of English Phonetics, §909 and footnote. 
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laboratory methods. 14 Certain other features contributing to the 
prominence of syllables can be investigated by laboratory methods, 
and notably duration and, to some extent, intonation 15 ; and I am 
disposed to think that what is taken by some authorities to be 
“semi-strong stress” on a and certain other English vowels is in 
point of fact merely prominence effected by these other means. 
I would add further that in my speech there are constant relations 
between intonation and what I judge to be stress, and that the 
intonations used on syllables containing “weakly stressed” a, a, 
etc., are the same as those applied to similarly placed syllables 
containing a. 

206. Even if it could be proved that all the a’s which I take 
to be weakly stressed have in reality a semi-strong stress, it is 
still not certain that it would be advantageous to utilize this fact 


14 As far as I am aware the experiments carried out by J. W. Jeaffreson 
constitute the nearest approach yet made towards the determination of stress 
by laboratory methods. These experiments were based on the assumption 
that degrees of stress are proportional to the amount of depression of the 
lower jaw, allowance being made for the fact that the amount and manner of 
lowering differs for different vowels. (Jeaffreson claimed that this is not 
an assumption but a proved fact. Incidentally, he was able to recognize 
certain vowels by the shapes recorded by their jaw-movements.) As the 
Southern English A is normally pronounced with considerable lowering of 
the jaw, Jeaffreson’s experiments apparently tend to support the theory 
of Trager and Bloch. On the other hand, although Jeaffreson’s premise 
concerning the relationship of stress to jaw-movement may hold good in 
most cases, we should not ignore the fact that it is possible to give strong 
stress to sounds said with closed or nearly closed jaws. How far strongly 
stressed sounds of this description actually occur in speech I am not prepared 
to say, but the fact that they can be made cannot but cast a certain doubt 
on the validity of Jeaffreson’s theory. 

Only a few of the results of Jeaffreson’s remarkable experiments have 
ever been published. These appeared in a paper entitled Stress and Rhythm 
in Speech in the Transactions of the Philological Society, 1938, a work which 
deserves close study. Many other results were incorporated in his Mensuration 
of French Verse (thesis for the London M.A., 1924) which has never been 
printed. This valuable work may be consulted in the Library of the University 
of London. 

15 In my experience the actual intonations revealed by laboratory methods 
often do not correspond either to the intention of the speaker or to the per¬ 
ception of the hearer. For example, the effect of a fall from high to low 
pitch is often produced by an intonation which can be shown by laboratory 
examination to be only a fall from a high to a mid pitch. 



ESTABLISHING PHONEMIC GROUPINGS 61 

for grouping a and a into one phoneme. The two sounds would 
be distinguished by a complex of quality and stress, and doubtless 
also of length, neither of these attributes predominating decisively 
over the other. When such is the case, either attribute can be 
regarded as the primary means of differentiation and the other 
considered subsidiary (see §491). Our choice might therefore be 
determined by some irrelevant external consideration. Such a 
consideration is the appearance and legibility of phonetic tran¬ 
scription. If the degree of stress were taken to be the primary 
means of differentiation, with the result that the letter a could 
be dispensed with, we should have to use a mark (|) to indicate 
semi-strong stress. The words 'hikAp, 'hAmdrAm, 'katapAlt would 
then be written 1 hi ( kap, 'ham |dram, 'kataipait, and the other three 
words quoted in §148 would be written 'sirap, ka'nandram, 'difikalt. 
From the point of view of legibility I believe this system would be 
inferior to the equally justifiable system of using the separate and 
easily distinguishable letters a and a. 

207. Another solution of the problem would be to regard 
the difference of length between a and a as the primary means 
of differentiation. We might in fact either write all a’s of the 
type of English under consideration as a with a length-mark 
or all non-final a’s with a mark indicating extreme shortness. 
The first of these alternatives would be inconvenient, as it would 
mean using a length-mark in the representation of a sound which 
is not longer than the other short vowels. The second alternative, of 
writing say a in place of every ordinary a, would impair both legibility 
and ease of writing, owing to the commonness of the sound. 18 

208. I regard it as necessary to bear in mind continually in 
phonemic investigations that one of the main objects of the phoneme 
theory is to establish a basis for the elaboration of simple and legible 
forms of writing (Chap. XXXI). Consequently, though I advocate 
the principle of writing by means of the bare minimum of letters, 
I feel that in the case we are now considering the best solution is 
to use the two letters a and a. 

209. The foregoing discussion (§§197-208) relates to types of 
English in which a and a are clearly distinct in quality—where 
the two sounds said in isolation with identical stress and length 
are easily distinguishable. Other types exist (especially in the 


18 It occurs approximately 40 times in every 100 words. 
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North of England and in America) in which a is very near in sound 
to a, and in these the Trager-Bloch method of phonemic classifica¬ 
tion would presumably be the only appropriate one . 17 

210. The following is another example, this time from a European 
foreign language, where doubt might at first be felt as to the proper 
phonemic allocation of a sound. It is the case of the Italian final 
weakly stressed o in such words as 'ekko ( ecco ), 'kwattro ( quattro ). 
In the pronunciation of many Italians this vowel is a rather open 
one, nearer in quality to the 3 of 'skwola ( scuola ), 'notte (notte) 
than to the close o of 'dove, 'sopra. Representing this final vowel 
narrowly by the symbol 9 for the purpose of this discussion, we 
may express the case thus: 9 is acoustically nearer to 3 than to o, 
and 9 is always weakly stressed while 0 is always strongly stressed 18 ; 
such a state of affairs naturally tempts a foreign observer to assign 
9 to the o-phoneme. Further observations show, however, that 
this allocation cannot be maintained. For if we examine a number 
of words containing weakly stressed non-final o’s, such as do'mani, 
dot'tore, kon'tsnto, or'ribile, for'tissimo, 'ekkomi, 'kredono, we 
find that a whole series of vowel-sounds exists ranging by imper¬ 
ceptible degrees from nearly close o to the comparatively open 9 
of final positions. In this series it does not seem possible to draw 
any definite line of demarcation which would justify assigning 
the final 9 ’s to a different phoneme from the other unstressed o’s. 
The situation is probably about as shown in the accompanying 
diagram (Fig. 5). Such being the case, it is necessary to assign 9 
to the o-phoneme . 19 

17 Many people in the North of England and in America do not have any 
A-sound in their speech. Where Southern English has A, they use an 3 of 
the same quality as the ordinary weakly stressed 3 ; for instance, they 
pronounce cup as ksp. The above discussion has naturally no application 
to this pronunciation. 

18 Unless we take sentences into account, which is contrary to the principles 
which in my opinion should be followed in investigating the nature of the 
phoneme (see §§31, 34, and Some Thoughts on the Phoneme, pp. 127-132). 

19 If in the pronunciation of some Italians there is a clear line of demarcation 
between the quality of non-final unstressed o’s and that of final 9 , it would 
mean that 9 need not necessarily be attached to the o-phoneme in their 
speech. The case for attaching it to o would be no better than the case for 
attaching it to 3 . We should have to choose one of them. This could be 
done purely arbitrarily, or by taking into consideration some external fact 
unconnected with the idea of the phoneme. Such a fact would be the history 
of the sound, which would naturally lead us to prefer the o-pnoneme. 
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211. It may be mentioned here that in some cases of apparent 
difficulty the investigator should take account of the “linguistic 
sense” of speakers of the language under examination. I believe 
it to be a fact that the instinctive feeling of a native whose speech 
is unstudied is at times a better guide than the reasonings of the 
phonetician. Thus a phonetic expert might be disposed to reason 
that the Italian and Spanish g should be assigned to the ji-phoneme 
since the tongue position of g is nearer to that of ji than to that 
of n. But the “linguistic sense” of Italians and Spaniards leads 
them to consider g as "a kind of n” and not as “a kind of ji.” If 
a reason is assignable, it is no doubt that ji is always completed 
with an off-glide (a rapid j); g has no off-glide in Italian and Spanish, 
and is therefore felt to be nearer to n which also occurs without an 
off-glide. 



212. Sometimes the natural “linguistic sense” of the native is 
difficult to ascertain, owing to prejudice created by orthography. 
A case in point is the labio-dental rg which is used in Italian to 
the exclusion of other nasal consonants before f and v, as in 
[■nirgfa] ( ninfa ), [irrj'vetje] ( invece ). The circumstances under 
which this sound is used show that it must be grouped into the 
same phoneme as one of the other nasal consonants. Some might 
be inclined to assign it to the n-phoneme, but I think that this 
feeling would be mainly, if not entirely, due to the influence of 
the current orthography. I believe that if orthography could be 
disregarded, the natural linguistic sense would be found to be 
that the sound is “a kind of m.” This is borne out by the fact 
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that children who have this pronunciation in English are liable 
to write envelope as emvelope. On acoustic and organic grounds 
I think too that the Italian n] should be assigned to the m-phoneme. 
It is worthy of note also that the same sound occurring in Greek 
is written with p, as in 'simfoni [ l sirgfoni] (o\/p<pcovoi, agreed). 

213. Further instances of the possibility of more than one 
method of phonemic grouping are given in Chapter XX. 

( 2 ) Complexity of Controlling Principles 

214. The case of the French sound (or rather the set of shades 
of sound) known as "e moyen” or “middle e” affords an illustration 
of difficulties in establishing a phonemic classification by reason of 
complexity of “controlling principles” (§30). It also illustrates the 
importance of avoiding the introduction of non-phonetic criteria. 20 

215. Usage varies very much with different French speakers, 
but for the purpose of the present enquiry we will examine the 
following distribution of the various e and £ sounds, which is, I 
believe, not uncommon in Northern France. (Middle e is repre¬ 
sented narrowly in what follows by the symbol §.) 

(1) Close e (near to cardinal e) occurs in final open syllables: 
te {the), ne (nez), Jate {chanter), ke {quai), etc. 

(2) Middle $ (various shades) occurs in non-final syllables: §kla 
{eclat), s(jde {cider), r^p^te {repeter), b^tiiz {bitise), d<|sa:dr 
{descendre), {effet), etc. 21 

(3) Open e (about cardinal) occurs 

{a) in final open syllables: ne {natt), ob 3 E (objet), vre {vrai), 
etc. 22 ; 

(6) in final closed syllables long and short: me:m {mime), 
meitr (mattre), te:t {tite), metr {mettre), det {dette), Jef 
{chef), etc. 


20 In §§215-229 most of the words quoted as examples are written in a 
narrow transcription. I have not thought it necessary to enclose them all 
in square brackets as is done in most other parts of the book (§46). The 
few cases in which the transcriptions are broad are evident from the context. 

21 Some speakers also pronounce with middle § some or all of the words 
here grouped under 4 (6). 

22 Some Parisians use close e in all or many of these words. 
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(4) Open e (cardinal or slightly less open) occurs 

(a) in non-final closed syllables: vest5 ( veston ), ermit (ermite), 
ekta:r [hectare), ekski [exquis), preskri [present), Espais 
[espace), enmi [ennemi), etc. 23 ; 

[h) in non-final open syllables: emabl [aimable), pkzi:r 
[plaisir), erceir [erreur), mele [mSler), ede [aider), etc. 24 

216. Basing our enquiry on the above distribution of the e and 
e sounds, we find that there are undoubtedly two phonemes and no 
more. These we represent naturally by the letters e and e. Further 
we find that the middle. ?'s must be assigned to the e-phoneme 
and not to the E-phoneme. 26 For it will be seen from §215 (1) and 
(2) that middle § only occurs in situations where close e does not 
occur; whereas a comparison of (2) and (4) shows that middle % 
may occur in the same situation as open e. 

217. It follows from this that the distinction between middle $ 
and close e can never be significant, but the distinction between 
middle $ and open e is capable of being significant. 2 * 

218. The fact that there is so much variation in the pronuncia¬ 
tion of different French speakers (see footnotes) illustrates well the 
necessity for establishing phonemic classifications from the speech 
of one individual, and taking no account of any other. 

219. It will not do, for instance, to say that “the middle £ 
used by those who pronounce ^mabl must belong to the £-phoneme 
because other people pronounce Emabl.” Such a manner of 
reasoning would cause the collapse of the whole theory of phonemes. 
We must simply observe that people do not all use the same 
phonemes in the same words. 

220. Other non-phonetic criteria which are inadmissible are 
“root forms” or “the vowel which the syllable would have if 
strongly stressed.” For example, the fact that b^tiiz [bitise) may 
be supposed to be a derivative of a root form be:t [bite) does not 

** Some speakers use middle ^ in some of these words, e.g. ^ktair, §kski, 
£spa:s, £nmi, ^skimo. 

14 Many speakers use middle § in some or all of these words: §mabl, pl^zi:r. 
mqle, ?de. 

15 Subject to the particular case noted in §§307, 312. 

13 Example: ^Iboef [et le baeuf), clbcef ( Elbeuf ). 
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provide a reason for assigning the % to the e-phoneme. This is 
simply a case where a derived form is said with a phoneme different 
from that of the root form. If we were analysing the pronunciation 
of those who pronounce triple, §de, etc., many further examples 
would be found, since m§le, $de, etc., are pronounced by those 
speakers with phonemes different from those in mE:l {mile), £:d 
{aide), etc. 

221. Similarly the fact that some speakers pronounce est (is) 
as £ in strongly stressed positions and q in weakly stressed 
positions 27 does not mean that this § belongs to the £-phoneme 
in their speech; it is merely an instance of a word changing its 
phoneme according to the phonetic context. 28 Mats is another 
such word: some speakers pronounce ms when a pause follows, 
but m? in m§n5 {mais non), etc. 29 

222. Two further facts that might strike some observers as 
constituting difficulties are (1) that there may be “overlapping” 
of the French e and £ phonemes and (2) that some French people 
have erratic pronunciation of particular words in a given context. 
These points are dealt with in Chapters XIX and XXVIII, from 
which it will be seen that they do not affect the conclusions arrived 
at here. 

223. The phonemic classification of the German e, e and a 
sounds presents features of interest owing to a certain complexity 
in the controlling principles, and to the fact that conclusive examples 
are rare. 

224. In the type of German described in Vietor’s Aussprache- 
worterbuch these sounds are used in phonetic contexts as follows: 

e. 30 Always long or half-long. Generally strongly stressed or 
medium stressed, but also occurs weakly stressed. Examples: 
(strongly stressed) 'e:ra {Ehre), 'ne:man ( nehmen ), fer'keir 
( Verkehr), ga'beit {Gebet), (medium stressed) e:vag'ge:lij 
(i evangelisch, first e:), 'anganeim {angenehm), 'zygkoipe: 
{Synkope), (weakly stressed) e:'IcktriJ ( elektrisch ). 

27 As in wi, il 1 £; n5, i n I § pa (oui, il Vest) non, il ne Vest pas). 

28 Just as the English word is is pronounced Z and S according to the 
phonetic context (e.g. hi: z 'gouig, it s 'gouig). 

22 This was Paul Passy’s pronunciation. 

20 Narrow transcription is used in the course of this explanation. 
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e. Occurs long and short. Both can have any degree of stress, 
but weakly stressed long e is uncommon. Examples: (long, 
strongly stressed) 'E:ra [Ahre), 'Einlig ( ahnlich ), (long, medium 
stressed) Eikvii'noktsjum [Aquinoktium) , (long, weakly 
stressed) Ei'rair (Arar), dEi'moinan ( Ddmonen); (short, 
strongly stressed) 'geltan ( gelten ), 'besar [lesser), 'hEnda 
(Hande), (short, weakly stressed) ber'liin [Berlin), per'zoin 
[Person), tnt'zetslig [entsetzlich), 'aletj [Aletsch), ‘eilent [elend), 
'eglsndar [Englander), mer'tseides [Mercedes). 

a (an opener variety before r and a closer variety elsewhere 31 ). 
Is always short and weakly stressed. Examples: 'bita [bitte), 
'oina [ohne), ga'vis [gewiss), ba'axtug [Beachtung), 'bitat 
[bittet), 'untar [unter), 'hundart ( hundert), 'mindastans 
[mindestens), 'feilant [fehlend), 'feilandar (fehlender), 'a:bants 
[Abends). 32 

225. An examination of the foregoing examples shows that, 
in the style of speech under consideration, e and e belong to separate 
phonemes, and that a is assignable to the same phoneme as e. 
That e and e belong to separate phonemes is shown by pairs such 
as 'e:ra, 'cira, ‘zeian [sehen), 'z£:an [sden). That e and a belong 
to separate phonemes is shown by pairs such as 'eilent, 'aibant, 
'Erjlendar, 'feilandar. 33 That a is assignable to the e-phoneme is 
shown by the fact that in comparable phonetic contexts e is always 
longer than a; the e of 'zygkoipei [Synkope) is longer than the 
final a of 'zigkanda [sinkende), and the e of ei'lektrij is longer than 
the a of ga'lekt [geleckt). 

226. In broad transcription we should therefore write 'e:re 
[Ehre), 'neimen, ei'lektrij, 'Eire [Ahre), 'cinlig, 'gelten, 'beser, 
'hende, ber'liin, 'eilent, 'bite, 'oine, 'mindestens, etc. 

227. The phonemic status of these sounds is different in the 
speech of the numerous North Germans who do not use long ei, 
but who pronounce such words as ahnlich, Ahre with long close ei. 

31 Reference is here made to the pronunciation of those who sound vowel 
plus r. (See footnote 13 to §91.) 

33 In some styles of speech a is also used in certain pretonic closed syllables, 
e.g. in words beginning with the prefixes er-, ver- (ar-, far- instead of Er-, fer-). 

33 Other examples occur in the speech of those who use a in prefixes like 
er-, ver- (footnote 32 above), e.g. bfir'liinar, far'liiran. 
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In their speech z cannot be regarded otherwise than as a member 
of the same phoneme as e, since £ is always short and e always 
comparatively long, a, on the other hand, must be held to con¬ 
stitute a separate phoneme in this type of German, since differences 
like those in 'edcnt and 'feitant are not attributable to phonetic 
context. 

228. A broad transcription of this type of German would not 
need the letter z, but it would require the use of a. We should 
write 'errs (for Ehre and Ahre), 'ne:man, ei'lektrij, *e:nli$, 'geltan, 
'beser, 'henda, ber'liin, 'eilent, 1 bits, 'feilant, 'o:na, 'mindastans, 
etc. 

229. A curious point to note is that words like elend, containing 
weakly stressed £ in post-tonic syllable, are few in number and are 
seldom used. If they did not happen to exist, and if names were 
excluded from our enquiry as not belonging to the language proper, 
the three sounds e, e and a would apparently, in one type of German, 
be assignable to a single phoneme. (The type in question is the 
speech of those who pronounce dhnlich, Ahre, etc., with long close 
e:, and who use z in the prefixes er- and ver-.) One might perhaps 
make out a case for ignoring such a word as elend on the ground 
that it does not belong to the normal spoken language. Whether 
proper names should or should not be taken into account in ascer¬ 
taining the phonemic structure of a language is a question that 
must for the present be left open. It is one which needs investiga¬ 
tion, since proper names sometimes contain sound-sequences alien 
to ordinary words (cf. §§284, 298). 

230. There is some uncertainty as to whether the German 
consonants x and g belong to a single phoneme or constitute separate 
phonemes. The conditions under which these sounds are normally 
found are clear enough. They are as follows: (1) x only occurs 
when an a, o or u sound precedes in the same syllable, (2) g occurs 
in all positions other than these, namely (a) at the beginnings of 
syllables, (b) when a front vowel precedes in the same syllable, and 
(c) when a consonant precedes. Examples are: (1) bax (Bach), 
Jpra:x (sprach), dox (dock), ho:x (hoch), brux (Bruch, breakage), 
bu:x (Buck), (2) (a) ijei'mi: (Chemie), 'gryizailis (Chrysalis), ar'gaiij 
(archaisch) and all the words ending with the suffix -ijan (-chen), 
(b) i 5 (ich), 'tygtig (tiichtig), r£$t (recht), (c) mil? (Milch), dur$ 
(durch). 
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231. It will be seen from these conditions that it is possible 
to group 5 and x into a single phoneme if we know where the 
beginnings and ends of syllables are. This is often a matter of 
word-division and sometimes a question of knowing whether a 
certain sequence of sounds is or is not a prefix or a suffix. Now 
North Germans appear to have a particularly definite feeling for 
word-division, and they seem to feel various prefixes and suffixes 
as word-like entities. 34 In particular, they seem to feel the suffix 
-gan (-chen) to be a sort of “word.” They seem to take it for 
granted, for instance, that the x of 'rauxan ( rauchen ) terminates 
a syllable and that the 5 of 'fraugan ( Frauchen ) begins a syllable; 
in other words, they appear to have a clear perception of the 
morphological structure of the words. If, therefore, we consider 
the suffix -chen to be a word, our case for regarding x and 5 as 
members of a single phoneme is all but complete. 35 

232. It would be quite complete were it not for the existence of 
the exceptional word va'xolder ( Wacholder , juniper) and a few proper 
names such as va'xau ( Wachau , district in Austria) in which x 
appears to begin a syllable. A case can, I think, be made out for 
excluding proper names from the materials upon which we base 
phonemic grouping in many languages (cp. §298), but it does not 
seem to me justifiable to ignore a reasonably common word like 
va'xolder in order to be able to establish a convenient phonemic 
system. For a similar reason I doubt the feasibility of the ingenious 
proposal made by J. L. M. Trim in Le Maitre Phonetique, July, 
1951, p. 41, to assign 5 to the same phoneme as h. The German 
words of Greek origin beginning with 5 are surely too numerous 
to be ignored in a phonemic account of normal German. 36 

34 It will be recalled that in my view a system of phonemes must be based 
upon the pronunciation of isolated words and not upon connected speech 
(§34). See also Some Thoughts on the Phoneme, pp. 127-132. 

35 The theory that German x and 5 belong to a single phoneme was, I 
believe, first propounded by L. Bloomfield (see Le Maitre Phonetique, 
April, 1930, pp. 27, 28). 

36 I have heard from several sources that Wacholder is pronounced with a h 
in it (vax'holder) by some Germans. If this pronunciation could be taken 
as the standard form in this and ail similar words, the establishment of X 
and q as members of a single phoneme would apparently be demonstrated. 
As matters stand, however, the evidence in favour of the pronunciation 
vax'holder is insufficient. Va'xolder is the only form recorded in Siebs’ 
Hochsprache and in Vietor’s Ausspracheworterbach. 
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CHAPTER XV 

GLIDING SOUNDS AND COMPOUND SOUNDS 

233. “Gliding sounds” and “compound sounds” (§§9-19), 
though audibly divisible into parts, have to be regarded for 
linguistic purposes as single entities. Like other sounds they 
belong to phonemes, and may occur either as principal or as sub¬ 
sidiary members. They are sometimes found in the same phonemes 
as "simple” sounds. 

234. In the following paragraphs examples are given of phonemes 
comprising gliding sounds and compound sounds. In §§243-257 
and §§270-272 will be found examples of simple sounds occurring 
in the same phonemes as gliding and compound sounds. 

Gliding Sounds 

235. The English w is a phoneme the members of which are 
all gliding sounds. They are all formed by starting at or near 
an u position and immediately moving away in the direction of 
some other vowel. Thus the w of wet starts near the English 
short u (as in huk, hook ) and immediately moves away in the 
direction of the e. The w of wo:k (walk) is different; it starts 
with a somewhat closer lip-position than the w of wet, and the 
gliding sound from this to the o: is evidently different both in 
tongue movement and in lip movement from the previous w. 
The w of wu: (woo) is again different; it starts with a much closer 
lip-position, and the passage from the initial position to the u: 
involves only a very slight movement. 

236. The English j is a gliding sound showing similar charac¬ 
teristics. The initial position of the tongue is much lower in 
ja:d (yard) from what it is in ji:ld (yield). The nature of these 
two members of the phoneme is shown in Figure 6. Although 
these two sounds are so very different in formation, it would never 
occur to any ordinary English person that they were anything else 
than "the same sound”—meaning members of the same phoneme. 

237. Diphthongs 1 must in my opinion be considered as indivisible 
gliding sounds. It is convenient to represent them in writing by 

1 For definition of the term “diphthong” see my Outline of English Phonetics, 
8 th edition, 1956 (Heffer, Cambridge), §219 ff. 
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digraphs, the first letter representing the starting point and the 
second indicating the direction of movement. In the following 
paragraphs are given some examples of diphthong phonemes which 
have subsidiary members differing notably from the principal 
members. 



238. One is the English ei. 2 In my speech the principal member 
is a sound gliding approximately from a sound near to my e of beg 
towards my i of big. This sound is used in final positions, e.g. in 
plei (play) when it occurs at the end of a sentence. When a 
consonant follows, closer varieties are generally used. Thus my 
ei in geim (game) has a higher starting point than my ei in plei. 
Before some consonants, and especially before t, s and J, a still 
closer variety of ei is used; examples pleit (plate), pleis (place), 
■steijn (station). 

239. A noteworthy instance of variant sounds which would 
appear at first sight to be two sounds belonging to a single diphthong 
phoneme is afforded by the sounds which may be written Ai and 
ae in common varieties of Scottish pronunciation. For instance 
in one type of Scottish, described in my book The Pronunciation 
of English (Cambridge University Press, 4th edition, 1958, p. 58), 
Ai is used to the exclusion of ae when one of the consonants p, b, 


a My pronunciation. These observations do not apply to the nearly 
monophthongal ei (approaching El) which is now very prevalent in the 
South of England. 
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t, k, g, m, n or I follows in the same syllable. Further examination 
of the occurrences of Ai and ae (see §522) shows, however, that 
the idea of grouping the two diphthongs into a single phoneme 
cannot be maintained. 

240. The above are examples where different members of a 
diphthong phoneme are conditioned by adjoining sounds in the 
words. Special members of diphthong phonemes may also be 
used in conjunction with particular degrees of stress or in con¬ 
junction with particular tones in tone languages. 

241. Examples of special variants conditioned by stress are 
found in the speech of some English people whose ai and au phonemes 
are represented by sounds of the types ai and au in syllables with 
weak stress. Henry Sweet pronounced in this way, and in some 
of his transcripts he used the narrow notations ai, au to denote these 
subsidiary members. He wrote for instance mistifaid, and haueva (for 
mystified, however), but faiv, sapraiz, haus (for five, surprise, house). 3 

242. Examples of diphthongal variants used in conjunction 
with special tones are found in Pekingese. For instance, in that 
language distinct members of the au-phoneme are associated with 
different tones. In particular the diphthong used in association 
with the third tone, as in / xau (good), is a notably lower variety 
than that used in words with other tones, e.g. "xau (plant). It 
might be represented narrowly by ao. 

Gliding Sounds and Simple Sounds as Members of the 
Same Phoneme 

243. Gliding sounds and simple sounds sometimes occur as 
members of the same phoneme. An example is found in the 
French j. The common members of this phoneme are gliding 
sounds similar to the English j-sounds described in §236. Thus 
gliding sounds are commonly used in jo ( yeux), jad ( iode), mjet 
( miette ), Ij5 (lion), kaje ( cahier), vjejarr ( viellard). In certain final 
positions, however, and particularly when the preceding vowel is i, 

3 Sweet, Primer of Spoken English, p. 78. In his Elementarbuch des 
Gesprochenen Englisch he wrote these sounds broadly, using ai, au in weakly 
stressed as well as in strongly stressed positions. The use of diphthongs of 
the 9|, au types in weakly stressed positions appears to be less prevalent 
in the Southern English of to-day than in Sweet’s time. 
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Northern French speakers commonly use a simple sound, fricative j. 
It is usual, for instance, to hear such words as firj ( fille ), bri:j ( brille) 
pronounced with fricative j. There is no reason for considering 
this fricative sound as other than a special member of the French 
j-phoneme. 

244. Other instances of simple sounds belonging to the same 
phonemes as gliding sounds are found in the case of various phonemes 
which have diphthongs as their principal members. 

245. The case may be illustrated by the English ei-phoneme. 
Besides the diphthongal members mentioned in §238 it comprises 
a monophthongal member which is used when a or 1 follows, as 
in pleia ( player) or feil (fail).* That the sound is monophthongal 
or approximately so is fairly easily heard in the group eia. The 
monophthongal character is less noticeable in the sequence eil, 
but it may be demonstrated by whispering the syllable and listening 
to its whisper pitch. In ei, said alone, the whisper pitch is heard 
to ascend 5 ; in eil it remains about stationary until it descends for 
the I. 8 The monophthongal variety of e heard in these syllables 
must- evidently be held to be a member of the phoneme which 
has the diphthong ei as its principal member. It is apprehended 
by speaker and hearer as being “the same sound” as ei, i.e. it is 
the variety of ei which it is natural to use in these special phonetic 
contexts. 

246. In broad transcription the words player and fail are written 
pleia, feil. To show in writing the monophthongal character of 
the sound written ei would necessitate introducing a special symbol, 


4 Here again I refer to my own speech, which is I believe similar to that of 
a considerable proportion of Southern English speakers. 


In my case from about A 


8 va -. A 

p= ,oB 4 




*J In my case A 


m 


descending to about 



The 


fact that the whisper pitch descends in the English syllable eil was first 
discovered by W. Perrett; see his Some Questions of Phonetic Theory, Part I, 
p. 32 (1916). 
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say e, to denote the short e-sound of let, fel {fell), etc. Play, 
player, pair, let, fell, fail could then be represented more narrowly 
by the notation [plei, plea, pea, let, fel, fel (or fe:l)]. The trans¬ 
cription of pair could be still further narrowed by marking the e 
so as to show that it is an opener sound than the vowel of fell, 
thus: [fel, p^a]. 8 

247. The treatment of the English diphthong phonemes ai and 
au when followed by a may be cited here in connexion with the 
subject of monophthongs in diphthong phonemes. 

248. In the speech of many English people words like fire and 
hire are pronounced with a sound which may be written aa. This 
is probably best regarded as a diphthong, though it may be taken 
to be a sequence of the two sounds a and a. 9 If it is held to be a 
diphthong, it constitutes a separate diphthongal phoneme. If it 
can be treated as a sequence of an a-sound + a, the a may either 
be assigned to the ordinary a-phoneme, which comprises the vowels 
in kat {cat) and ha:f {half), or it may be considered as a monophthongal 
member of the ai-phoneme. The only reason for suggesting the 
latter course is an external one unconnected with the phoneme 
theory, namely, that other speakers use pronunciations of the 
types aea, aea in such words—that there exist in fact with different 
speakers various shades of pronunciation extending from a clearly 
disyllabic aea to a monosyllabic or barely disyllabic aa. With 
those whose pronunciation shows any trace of tongue raising 
between the a and the a, the first part of the sequence (ae or at) 
cannot be treated as other than a member of the ai-phoneme. 
There is therefore a certain practical convenience in treating 
similarly the pronunciation of those who have a “level” pronuncia¬ 
tion aa. 10 

249. Similar considerations apply to the English diphthong au. 
Some speakers pronounce words like tower and hour with a distinct 
disyllable which could be written narrowly aoa or aaa. The ao 

8 Even so we should fall short of strict representation of one speech-sound 
by one symbol. The vowel of fell is opener than that of let, but less open 
than the initial part of the diphthong in pair. Stricter adherence to a “one 
sound one symbol ” principle would involve some such system of representa¬ 
tion as fel (fail), let (let), fgl (fell), pasa (pair). 

8 It is difficult to distinguish between the diphthong and the sequence. 

10 For the occasional possibility of assigning a sound to a particular phoneme 
on grounds unconnected with the phoneme theory, see §§322 ff. 
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or as in this form of pronunciation must without doubt be considered 
as a subsidiary member of the au-phoneme. Other speakers use a 
“levelled” form which is nearly or quite monosyllabic. Its value 
varies considerably according to the speaker’s dialect, but it is 
more often a backer sound than the as corresponding to ais. This 
form may consequently be represented phonetically by as. The 
a in this pronunciation may be regarded as a monophthongal 
member of the au-phoneme. Alternatively the a a may be con¬ 
sidered as constituting a separate diphthongal phoneme. 11 

250. Some English speakers who pronounce fire and hire as 
faa, haa use a disyllabic pronunciation with aia in the words having 
the suffix -er, e.g. 'haia (higher), 'flais (flyer), 'Jaia (shyer). In this 
style of speech the initial part of aa evidently cannot be considered 
as having anything to do with the ai-phoneme. If the aa of this 
style of speech sounds disyllabic, the a may be assigned to the 
ordinary English a-phoneme. If the aa is definitely monosyllabic 
it must be held to constitute a separate diphthong phoneme [aS]. 
Similar considerations apply to such words as plougher, allowance. 

251. The West of England pronunciation of “long o” gives us 
a further illustration of the occurrence of a diphthong and a pure 
vowel in the same phoneme. Before j the sound used is with 
many Western speakers monophthongal: mo'j (more), 'pouiq 
(pouring), fojs (force), etc. 12 But in other situations a diphthong 
of the ou type is used: sou (so), oun (own), etc. 

252. It has been suggested that the so-called monophthongal 
o-sound often heard from English speakers in such words as molest, 
obey, profound, November, should be regarded as a member of the 
ou-phoneme. If it can be established that there are speakers who 

11 The phonemic status of the 'Tevel” forms of aia and aus presents a 
singularly difficult problem. The values of the sounds vary from one speaker 
to another, and the differences of classification depend upon the starting 
points of the speaker’s ai and au and upon the value of his long a:. The 
problem is complicated by the fact that many people use a still more ‘’level" 
pronunciation than as and as, viz. varieties of long ai, in such words as 
fire, tower. These generally differ somewhat from the long a: of half as well 
as from each other. Moreover, the pronunciation of one and the same 
speaker may vary according as he is speaking rapidly or slowly. Con¬ 
sequently people’s pronunciation of such words is often erratic and cannot 
be reduced to phonemes. Some further information on the subject is 
given in my Outline of English Phonetics, §§414, 415, 430, 431. 

18 Also pronounced mosj. ’posjil], foSJS, etc., by some. 
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use a really monophthongal sound which they employ to the 
exclusion of ou in definite phonetic contexts, e.g. in pretonic 
position, then it furnishes us with yet another example of the 
grouping of a simple sound and a gliding sound into one phoneme. 

253. In my opinion, however, the conditions under which this 
o-like sound is used are not such as to make it groupable into the 
same phoneme with any other vowel. I do not think it can truly 
be said to be a sound of any consistent form of the English language. 
It is rather what may be called an “unstable” sound. Every word 
in which it is used has alternative pronunciations with one of the 
stable English sounds ou or a, and I do not think any English 
speaker can be found who uses the sound in question consistently 
and to the exclusion of diphthongal ou in any particular phonetic 
setting. It is, I believe, always used in an erratic manner, that 
is to say, people sometimes use one pronunciation and sometimes 
another without adhering to any consistent principle. That such 
a way of speaking cannot be reduced to phonemes is explained in 
Chapter XXVIII. 

254. We can, as it seems to me, only assign this monophthongal 
o-like sound to the ou-phoneme if we take as the form of English 
to be recorded the pronunciation of an imaginary consistent speaker 
who always uses it to the exclusion of diphthongal ou in pretonic 
position or some other definite phonetic context. 13 

255. Broad transcription of such a normalized form of speaking 
would involve using ou to denote both the diphthong ou and the 
special monophthongal o-sound, with the convention that the latter 
sound is meant in the particular contexts referred to in the last 
paragraph. We should thus write mou'lest, ou'bei, prou'faund, 
nou'vemba, etc., if our imaginary speaker is considered to use 
the special monophthongal o-sound in such words. 

256. A case could, I think, be made out for regarding the mono¬ 
phthongal o-sound under discussion as a separate phoneme in the 
pronunciation of speakers (if there are any) who use it consistently 
in a particular set of words. To transcribe broadly the speech 
of such people would require that we should denote this sound by 

13 The pronunciation of such an imaginary speaker would be recognizable 
as perfectly “good” English. 
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the letter o and use a special symbol (a) in the representation of 
the ordinary short o of hot, long, etc., and the long o: of caught, 
fall, etc. Thus we should write mo'lest, o'bei, no'vemba, 
moules'teijn, hat, lar), ka:t, fail. This manner of transcribing has 
been followed in many published texts. 

257. A further case of the existence of a pure vowel and a 
diphthong in one phoneme, a case which if it can be established is 
a particularly striking one, is that of Pekingese u. In that language 
there is an u-phoneme, the sound of which is usually a rather 
open variety of monophthongal u. But when the phoneme is 
preceded by i and the syllable is said on the third or fourth tone, 
the sound used is diphthongal and sounds like au. The case may 
be exemplified by the words 'iu (sad), / iu (oil) which have mono¬ 
phthongal u, and /iau (have) and Nau (right hand) which have 
the above-mentioned diphthong. If this analysis is correct, these 
latter words would be written / iu, Nu in broad transcription. 
J. P. Bruce did in fact write them narrowly with au in the “Tone 
Practice” examples on p. 40 of his Linguaphone Chinese Course 1 *', 
in the following exercise and throughout the rest of the book he 
wrote them broadly with u. 15 


14 First edition. 

15 An observation of the pronunciation of two Pekingese speakers leads 
me to think that the very marked diphthong used in these third and fourth 
tone words should perhaps not be considered as belonging to the U-phoneme 
at all, but to the ou-phoneme, and that in consequence the above four words 
should be transcribed “iu, / iu, / iou, Nou. If this is the correct view, it means 
that the sequence iu never has the third or fourth tone, and the sequence 
iou never has the first or second tone. Karlgren, howevgr, appears to be 
in substantial agreement with Bruce. He says ( Phonologie Chinoise, p. 318) 
that with many speakers the "final” iu varies with the tone, and that many 
Pekingese pronounce iu on the first tone and a sound which he writes ieu 
on the other tones. It is worthy of note too that in the " Gwoyeu Romatzyh ” 
( / kuo / y / luo^ma s tSS) system of writing, in which the tones are shown by 
means of spellings, this syllable is written with ou or ow: iou, you, yeou and 
yow. Iou and you are pronounced “iu and / iu, while yeou and yow appear 
to me to be pronounced almost, if not quite, / iou and Nou. Similar con¬ 
siderations appear to apply to the syllable which appears in the first tone 
as “ui (second tone / ui, third tone / uei, fourth tone s uei). This point is 
noted by Karlgren ( Phonologic Chinoise, Vol. II, p. 732, footnote 2). 

Pekingese pronunciation being rather variable, it is probable that the 
degree of diphthongization of the "diphthongal u” is greater with some 
speakers than with others, and that consequently the sound has phonemic 
affinity to u in the speech of some and to ou in the speech of others. 
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Compound Sounds 

258. Phonemes comprising compound sounds (§9) are very 
common, and are found in all languages. When the sounds are 
of the second category (§§16-18) there are often a great many 
members. Such phonemes are the English t and k. Five members 
of the English t-phoneme were mentioned in §72. As for the 
k-phoneme, it has different members according to adjacent vowels 
(§21), and it has more or less aspirated members according to the 
degree of stress, less for instance in 'stokig (stocking) than in kit) 
(king), and still less after s, as in 'biskit (biscuit), and so on. 

259. When the sounds are of the first category (§16), there are 
not as a rule many members of the phoneme. The English tj- 
phoneme, for instance, has little in the way of subsidiary members 
beyond a variety somewhat less aspirated than the principal 
member, which occurs in weakly stressed positions. Thus there 
is a certain aspiration of tj in tjin (chin), less in 'hatJit (hatchet), 
and hardly any when another consonant follows, as in wotjt 
(watched). 

260. A difficulty of an unusual kind is encountered in connexion 
with certain compound consonants of colloquial Sinhalese. 18 This 
language contains two nasal consonant phonemes, m and n (the 
latter of which includes two subsidiary members q and rj 17 ); they 
occur in various phonetic contexts, and in particular they may 
precede b and d, <(_, g respectively. The language also contains 
four sounds which I write here as mb, nd, n«J, ng; they are regarded 
by Sinhalese speakers as single sounds, but analysis shows them 
to be of a compound nature—plosive consonants preceded by a 
very short nasal element. The nasal element is so short that the 

18 I refer here to the ordinary spoken language of the better educated 
inhabitants of Colombo, Ceylon. Formal Sinhalese differs from this in various 
respects; in particular, q and q are phonemes separate from n in formal 
Sinhalese, since in that style of speech q occurs before vowels and n occurs 
finally. See Perera and Jones, The Application of World Orthography to 
Sinhalese, in the Bulletin of the School of Oriental Studies, London, Vol. IX, 
Part 3, p. 706. Incidentally I have been informed by a Sinhalese that even 
the most highly educated Sinhalese find it difficult to speak for any length 
of time in the theoretically correct formal style. 

17 q is used when £ or c|_ follows. Q is used finally and when k, g or h 
follows. Ordinary n does not occur in these positions in the colloquial 
language. (It does occur finally in the formal language.) 
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word kande (trunk) has the same rhythmic pattern as kada (shoulder 
pole carrying weights at each end); it is not at all like the English 
word candour. 

261. A case of this nature raises the question whether one is 
at liberty to divide into two parts a sound which is regarded by 
native speakers as indivisible, and whether, if one does divide it, 
the parts can properly be assigned to other phonemes existing in 
the language. Thus in the present case is it admissible to divide 
fid into fi and d, and to assign the fi to the n-phoneme and the d 
to the d-phoneme? 

262. I am disposed to think that in making a classification 
into phonemes the view of the native speaker as to what constitutes 
a “single sound” should be regarded as important, though not 
necessarily conclusive. 18 If the Sinhalese people regard mb, fid, 
etc., as being indivisible, we should give their view serious considera¬ 
tion, even though it should conflict with the ideas we have derived 
from European languages; we should be open-minded enough at 
least to examine the possibility of regarding these sounds as con¬ 
stituting separate independent phonemes. My personal opinion 
is that they should be so regarded. 

263. This does not mean that we must necessarily avoid using 
such letters as m, n, b, d in the representation of these sounds in 
phonetic writing. If a Sinhalese were attempting to devise a 
romanic orthography for his language, his first impulse would be, 
I believe, to represent the above sounds either by specially designed 
letters or by b, d, g with a diacritic attached (e.g. 6, d, c[, g 19 ). 
The use of completely new letters has much to be said in its favour, 
but against it must be set (1) the difficulty of devising four new 
letters which are sufficiently simple and which harmonize properly 
with the other roman letters, (2) the fact that in the view of many 
it is preferable to keep the alphabet as small as practicable (see 
§684) . The use of b, d, cj_, g with diacritics attached is unsatisfactory 
for two reasons: (1) because all systems of diacritics are in themselves 


18 For instance, I have heard English people express the opinion that the 
ks of box is a "single sound.” Such a view cannot be maintained without 
regarding the ks of locks also as a "single sound.” And if ks. why not also 
ps, gz and many other consonant groups? 

18 Using ” in the sense of a slight nasality preceding the plosion. 
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undesirable, and should be avoided if possible, 20 (2) because there 
are typographical difficulties in putting a diacritic above tall roman 
letters like b and d. 

264. Fortunately it is easy to devise a system of phonetic writing 
which involves neither the introduction of new letters nor the use 
of diacritics. Let the sounds under discussion be denoted by the 
convenient digraphs mb, nd, ncj, ng, and let the sequences m + b, 
n + d, etc., which have a prolonged nasal consonant, be represented 
by prefixing another nasal consonant, thus mmb, nnd, etc. This 
mode of writing the latter sounds may be interpreted either as long 
m + b, long n + d, etc., or as m + mb, n + nd, etc. (since long 
m + b is indistinguishable in sound from m -f mb, and so on). 

265. The following are examples of Sinhalese words written 
phonetically by this system (which I will call System A): 

amba (mango) gemmba (frog) 

kanda (trunk) kannda (hill) 

hancja (sound) kanncjjja (mound, embankment) 

anga (horn) annga (limbs) 

266. Another possible system (System B) would be to use the 
diacritic ”, in which case the long nasal sounds would be represented 


by one letter only, thus: 


amba 

gemba 

kanda 

kanda 

hancja 

kancjjja 

anga 

anga 


267. I think this course is really less satisfactory than the other, 
firstly on account of the disadvantages inherent in diacritics, and 
secondly because mb is rather a cumbrous way of representing what 
native speakers regard as a single sound. The Sinhalese phonetician 
H. S. Perera preferred, however. System B. 

268. In Sinhalese, nasal consonants may precede consonants 
other than b, d, c[, g. In these cases they are fairly long and are 
naturally identified as m, n (q, g) by Sinhalese speakers. They 
therefore correspond to the sounds written mm, nn in System A. 
It would, however, not be necessary to write them double, since 
no confusion could ever arise if they were written with single 
m, n. Examples: ampitiga (name of a place), antima (end), 


20 See footnote 7 to §277 (p. 84). 
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karanfa* 1 (to do), anka 28 (numbers), maincu (fetters), sinhea 2 * (lion). 
If System A were adopted, this would be a case where phonetic 
consistency might well be sacrificed to practical convenience. If 
System B were adopted, no inconsistency would arise. 

269. The answer to the question propounded in §261 should 
then, in my opinion, be that the mb, fid, fig of colloquial Sinhalese 
are best regarded as separate single phonemes, but that it is con¬ 
venient to represent these phonemes by digraphs made up of letters 
representing other phonemes with or without the use of the 
diacritic ”. The case is analogous to that of diphthongs. 

Compound Sounds and Simple Sounds as Members of 
the same Phoneme 

270. The "stop” elements of plosives are "simple” sounds. 
Not unfrequently these occur as subsidiary members of plosive 
consonant phonemes. Thus the k-phoneme is represented by stop 
only in the common pronunciation of such English words as akt 
{act), 'lektja {lecture). 

271. A stop may also be a member of an affricate consonant 
phoneme, but apparently this only occurs in the first part of a 
doubling ("lengthening," §§360 ff.): e.g. in Italian 'fattja {faccia), 
'msddzo {mezzo), Hindi bacca (child), hay am (barber). 24 

272. An example of a simple consonant phoneme which has a 
compound consonant as a subsidiary member is found in Tamil. 
In that language the phoneme which it is convenient to denote 
by c has for its principal member a sound resembling the Polish e. 
This is the sound used initially before all vowels except i, e.g. in 
col (word). Intervocalically it is commonly voiced; thus acai 
(shake) could be written narrowly a?ai. But initially before i, 
and medially when doubled, an affricate c is used: examples, 
cinam (anger), peiccu (talk). For further details, see J. R. Firth’s 
Short Outline of Tamil Pronunciation, pp. xi, xii, from which the 
above examples are taken. 

11 Narrowly [karaq£a]. Also pronounced with (karanncja in System A). 

21 Narrowly [agka]. 

23 Narrowly [sighea]. 

24 When two affricates follow each other in English, the first is fully pro¬ 
nounced. The case only arises in compound words such as 'wotjtjein 
(watch-chain). Repeated affricates are occasionally found in Russian, and 
then too the first is fully pronounced; example: 'tjtjetnij (TmeTHbltt, vain). 
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CHAPTER XVI 

COLOURED VOWELS 

273. Sets of vowels can be pronounced which differ from common 
vowels by having one or more special ‘‘colourings.'’ The chief 
kinds of colourings are 

(1) nasalization, when vowels are said with simultaneous 
lowering of the soft palate, 

(2) r-colouring, when vowels are said with a quality reminiscent 
of a frictionless j, 

(3) breathiness, when vowels are ‘‘aitchified,” i.e. pronounced 
with “husky” or “breathy” voice, due to the emission of 
more air than is needed for ordinary vowels, 

(4) creakiness, when vowels are pronounced with "creaky” or 
“intermittent” voice produced by a pharyngal contraction 
somewhat like that used in forming the Arabic consonant S, 

(5) sulcalization, when vowels are said with a “throaty” voice- 
quality due to sulcal tongue position (raising the sides of 
the back part of the tongue and leaving a groove down the 
centre). 

All ordinary vowels can be said with any of the above modifications 
and with combinations of some of them. 

274. Vowels with the first four of the above colourings are 
not uncommon in languages, and the distinction between the 
coloured vowels and ordinary vowels are often significant. Thus, 
as is well known, nasalized vowels occur with significant function 
in French, Polish, Portuguese, Hindi, Ga and other languages, 
r-coloured vowels are found with significant function in various 
types of American and British English. Urdu and other languages 
of Northern India contain vowels said with breathy voice. So 
also do some languages of the Sudan. Vowels said with creaky 
voice occur in languages of the Sudan and (sometimes in conjunction 
with special tones) in Vietnamese and cognate languages. Sulcalized 
vowels are easy to make, 1 but I have not heard of any language 
in which they are used side by side with ordinary vowels for making 
significant distinctions. 

1 The old-fashioned Polish t (i.e. not the common modern sound which is a 
variety of w) is a sulcal sound. It resembles a sulcalized U. 
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275. The clearest method of representing coloured vowels 
phonetically is by means of digraphs, i.e. by denoting each coloured 
vowel by the letter for the ordinary vowel followed by a letter 
indicating the colouring. Thus nasalized vowels can almost always 
be denoted unambiguously by vowel letters followed by r), 2 r- 
coloured vowels by vowel letters followed by j, aitchified vowels 
by vowel letters followed by h, and so on. 

276. Deviations from the digraphic notation rnay be found 
convenient in particular cases. It is, for instance, convenient and 
economical to write the r-coloured a of ordinary American English 
simply with j instead of with the digraph aj: J0 [earth), bjd [bird], 
■stJiQ (stirring) 3 rather than aj0, bajd, 'stajiQ. And if there were 
a language containing a single nasalized vowel with a basic quality 
distinct from all the other vowels of the language, it would probably 
suffice to represent it either by rj alone or by an appropriate vowel 
letter without any indication of nasality. 

277. As coloured vowels are single sounds, it would be possible 
to represent them in writing by special letters or letters with 
diacritic marks. This has hitherto been the usual practice in regard 
to nasalized vowels: they have generally been written with " above 
the letter denoting the ordinary vowel, thus: a, 8. 4 Similar devices 
might be adopted for other coloured vowels. Thus letters like 

at, ? 6 or a diacritic J (a, a, etc.)’ may be used to denote r-coloured 
vowels, and diacritics such as superposed 4 and ’ to denote vowels 
with breathy voice and creaky voice. Except perhaps in the case of 


* Unambiguously as far as the particular language transcribed is concerned. 
According to J. R. Firth, Marathi is an exception in which nasalization 
must be denoted by a letter other than Q. See Jones, The Problem of a 
National Script for India, p. 14, footnote 3. 

5 Cp. striQ {string). 

4 Paul Passy, however, used the diagraphs eq, at), DQ, ceQ to denote the 
French nasalized vowels in the 1917 edition of his Premier Litre de Lecture. 
The same system is followed in the proposed new romanic orthography for 
Hindi; see Jones, The Problem of a National Script for India. 

Ordinary French spelling uses the digraph system (with n), and it works 
on the whole satisfactorily. It would be completely unambiguous if the 
letter for nasalization were rj instead of n. 

6 See Le Maitre Phonetique, Jan., 1947, pp. 16, 17. 

• J was used by R. J. Lloyd in his Northern English. 
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creaky voice I feel that the digraph system is the better mode of 
representation. 7 

278. The following are examples of words containing uncoloured 
vowels compared with similar words having vowels with various 
types of colouring: 

(1) nasalized vowels: French pa (pas), pag ( paon ), 8 mes (messe), 
megs (mince ): 8 Hindi hat (is), haig (are); 

(2) r-coloured vowels: American English 'mans (manna), 'manj 
(manner)-, South-Western British English 'fa:3j (father), 
'faj3j (farther)-, 

(3) vowels with breathy voice: Hindi ka (of), kah (grass), 
ja (go), jah (high rank), bair (revenge), baihr (sea), <|p[a 
(rope), <Johc a (wooden spoon), kora (new), kohra (fog), 
tera (curtain), tehra (threefold), jona (enchantment), tohna 
(feel); in the Abua language spoken at Ahoada, Southern 
Nigeria, doy (to be sweet), dohy (to speak a parable) 9 ; in 
Nuer karri (chair), kahm (insect) 10 ; 

(4) vowels with creaky voice: Vietnamese °be (basin), v be 
(broken), u zai (belt), v zai (saliva), u da (to knock), v da (sign 
of past tense). 11 

279. Vowels may have two colourings simultaneously. Examples 
are seen in the Hindi words mogh (mouth), sigh (hedgehog), 
kogh[a (sweet pumpkin), megh (rain), paoghena (to reach), in 
which the vowels are both nasalized and pronounced with breathy 
voice. 


7 Diacritic marks are known to be unsatisfactory from the point of view 
of legibility, handwriting and printing. See The Practical Orthography of 
African Languages, revised edition, pp. 4-0 (International African Institute). 
Letters with two diacritics are particularly objectionable. They would be 
needed if the diacritic system of indicating nasalization were applied to such 
a language as Panjabi which makes use of essential word-tones. See 
Grahame Bailey’s Panjabi Phonetic Reader. 

• More usually written pd, mt:s. 

• Example furnished to me by Professor Ida Ward. 

10 Example taken from A. N. Tucker’s description of Nuer in Westermann 
and Ward’s Practical Phonetics for Students of African Languages, p. 204, 
where a number of other instances are given. 

11 Examples furnished to me by Miss E. Henderson. u has been suggested 
by her to denote a rising tone accompanied by breathy voice, and v to denote 
a rising tone accompanied by a creaky voice. (See §587.) 
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280. Coloured vowels must, according to the principles suggested 
in this book, constitute phonemes separate from ordinary vowels 
occurring in similar phonetic contexts in the same language. Thus 
I think it necessary to regard the Hindi sounds u, uh, ug and ugh 
as four separate phonemes 12 


CHAPTER XVII 

VOWELS AND CONSONANTS AS MEMBERS OF 
THE SAME PHONEME 

281. A vowel and a consonant occasionally occur as members 
of the same phoneme. This is the case, for instance, with the 
French vowel y and the consonant y (more usually written q). 
In ordinary Parisian speech these sounds are not found in identical 
positions in words, y occurs (1) initially preceding a vowel, (2) 
after any consonant or consonant-group when i follows, (3) after 
single consonants when other vowels follow. It is used, for instance, 
in (1) yit ( huit ), (2) nyi ( nuit ), plyi (pluie), bryi {bruit), k5stryi:r 
{construire), (3) tys ( tuait ), nya :3 {nuage), nyfi:s {nuance), kwatjbir 
{quatuor). On the other hand the vowel y does not occur in these 
positions; it never occurs before i, and it only occurs before other 
vowels if two consonants precede, as in kryel {cruel), kryote 
{cruaute), iflydis {influence). 

282. It is to be noted that if word-groups were to be admitted 
into the criterion for establishing phonemes, y and y would have 
to be assigned to separate phonemes in French, on account of 

l * Some of those interested in the theory of phonemes have expressed the 
opinion that "colourings” can be regarded as phonemes, so that, for instance, 
a nasalized vowel may be looked upon as a group of two phonemes, viz. the 
vowel and the nasalization. It does not appear to me possible to maintain 
such a theory. One might as well treat as phonemes many other features 
of speech which are common to several sounds, such as voicing, dental and 
other articulations of consonants, lip-rounding of vowels. (One colleague 
has indeed suggested to me that the implosive character of the Sindhi voiced 
implosives should be regarded as a phoneme.) Such a view would involve 
an entirely different conception of the phoneme from that put forward in 
this book, and one which I think would have little practical value even if it 
could be worked out into a consistent system. 
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comparisons such as tya (tua) , tya (tu as). See §34, also Some 
Thoughts on the Phoneme, 1 p. 129. 

283. For reasons similar to those given in §281, u and w appear 
to belong to one phoneme in one type of French. In this type w 
occurs to the exclusion of u initially, as in wi ( oui ), west (ouest), 
wa ( oie ), and after single consonants, as in Iwe ( louer ), mwa ( moi ), 
but u is used to the exclusion of w after two consonants in such 
words as brust ( hrouette ), kiue ( clouer). 

284. This type of French is that of speakers who make no 
difference in sound between pairs of words like voua and voit, 
joua and joie (vwa, 3 wa). There are, however, many Frenchmen 
who distinguish them as vua, vwa, 311 a, 3 wa. With them w is a 
phoneme separate from u. The same is true of those who use the 
sequence ua in various proper names such as Touareg (pronouncing 
tuareg and not twareg ), 2 if it is considered impracticable to exclude 
proper names from the material required for phonemic analysis 
(§229). 

286. The case of French i and j is different. These sounds must 
be considered as belonging to separate phonemes on account of the 
occurrence of j in final position: compare trai ( trahi ), travaj (travail). 3 

286. It would seem that in Italian the gliding consonant j 
must belong to the same phoneme as the vowel i. The controlling 
principles are not entirely clear, and probably vary from one speaker 
to another, but it seems likely that with many Italians the phoneme 
appears as j when it is weakly stressed, is followed by a vowel and is 
not preceded by r or certain consonantal groups, such as kl. 
Examples where the sound is j: jeri ( ieri , yesterday), 'pajo ( paio , 
pair), 'sjamo ( siamo , we are), i'talja ( Italia , Italy), ita'ljano ( italiano , 
Italian). Examples where the sound appears to be usually syllabic i, 
in spite of the weak stress: oriblo ( oriolo , watch), 'pnprio ( proprio , 
one’s own), riuj'jita ( riuscita , success), kli'ente ( cliente , client), 

1 In the Transactions of the Philological Society, 1944. 

2 These speakers may be the same as those who distinguish voua from 
voit. The matter does not appear to have been investigated. 

3 It might be possible to make out a case for assigning i and j to the same 
phoneme in French on the ground of relations between the sound used and 
the incidence of the strong stress or the nature of the intonation. The con¬ 
ventions would, however, be difficult to formulate and inconvenient to work 
with in practice. 
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probably also 'grattsie ( grazie , thanks), diskussi'one ( discussione , 
discussion), inkwi'eto ( inquieto , anxious). Similar considerations 
probably apply to Italian w. 

287. It will be noticed that the use of i and u in ordinary Italian 
spelling conforms to this phonemic classification. 

288. It is possible that in some types of German too the con¬ 
sonant j should be held to be a member of the i-phoneme on similar 
grounds. Trubetzkoy was of this opinion. 4 He did not, however, 
give the evidence for his conclusion, and I am not at present able 
to produce conclusive arguments for or against the theory. 

289. Certain details of transcription depend upon what is 
considered to be the phonemic status of j in such a language as 
Italian. If j is held to belong to the i-phoneme, and is therefore 
written broadly with the letter i, some indication must be given 
that it is weakly stressed as compared with the vowel following it. 
The transcription ita'liano, for instance, would be ambiguous: it 
might mean [ita'ljaino] or the non-existent [ita'liiano]. Probably 
the best system is to retain the letter j; another plan would be to 
write T in the relevant cases. In words like discussione, where 
usage is variable or doubtful, the position of the stress-mark would 
be a matter for consideration: one might write either diskussi'one 
or diskus'sjone. 

290. Sounds so different as i and z occasionally occur as members 
of the same phoneme. This is found in Cantonese and other types 
of Chinese, where the i-phoneme following s has the value of a kind 
of z. Examples from Cantonese are "si [narrowly ~sz] (four), 
-tsi [ - tsz] (most), S tshi [ N tshz] (later). 

CHAPTER XVIII 

MANNER OF USE AS A CRITERION 

291. Sometimes the manner of using a sound has to be taken 
into account in settling its phonetic status. A particular manner 
of use may on occasion render it necessary to class a sound as a 
separate phoneme, or to group it phonemically with another sound 
having the same manner of use, and to disconnect it from the same 
sound when used in a different manner. In other terms, identity 


4 Grundziige der Phonologie, p. 64, footnote (in Cantineau’s translation, 
p. 74). 
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in manner of use occasionally forms a stronger bond than phonic 

identity. 

292. This appears to be the case with the Japanese syllabic 
rp, n and g . 1 These syllabic sounds occur in Japanese only before 
homorganic consonants. Thus, syllabic rp occurs only before p, 
b and non-syllabic m; examples: sampo (walking), sirpbug (news¬ 
paper), semmog (speciality). Syllabic g occurs only before k and 
non-syllabic T), and exceptionally before -g in the word -go (five) 
(which is always treated phonetically as a separate word); examples: 
degki (electric power), sarjgai (third storey), oggaku (music), sag-go 
(three times five). Syllabic n occurs only before consonants other 
than the above, including non-syllabic n; examples: sendo: (boat¬ 
man), kandjo: (account), konnag (confusion), onna (woman). Non- 
syllabic m, n, g occur only before vowels, as in nomu (drink), 
[kago] (basket). 2 

293. It is natural to a Japanese to regard the syllabic m, n, g 
as being all “the same sound’’—that their syllabic use (which 
involves a certain length and a lack of “off-glide”) connects them 
with each other, and that they have, to a Japanese, more resemblance 
to each other than they have to the non-syllabic m, n, g (which 
are short and pronounced with off-glide). A Japanese does not 
apparently look upon the syllabic rp of sarnpo as having any 
particular relation to the non-syllabic m of say samasu (cool); he 
considers it to be “the same sound” as the n of sendo: and the g 
of degki. 

294. He also finds it natural to identify these three syllabic 
sounds with their final nasal sound g, which is heard in such words 
as sag (three), keg (prefecture). 3 Syllabic rp is to him “a kind 

1 A " syllabic ” consonant is one that is used in the way that vowels generally 
are. It is made prominent (§§434, 435) enough to be reckoned in counting 
the number of syllables in a word or sentence. In most situations it gets 
its syllabic character by being pronounced with greater length than non- 
syllabic consonants. The Japanese syllabic consonants occupy approximately 
the same length of time as ordinary syllables which consist of a non-syllabic 
consonant plus a vowel. 

2 Non-syllabic g may be a member of the g-phoneme in some forms of 
Japanese (see §122). It is not used by all speakers; some use g instead, and 
pronounce the word for "basket” as kago. 

3 The sound here written with g is a peculiar nasal sound which appears 
to vary between g and y, and which involves partial or sometimes complete 
nasalization of the preceding vowel. It is written (adequately) with n in 
romanic orthography. 
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of n,” and not “a kind of m”; syllabic n is likewise to him “a kind 
of r]” (not “a kind of n”), and syllabic g is to him “a kind of n” 
(not "a kind of g or g” nor a separate phoneme). 4 * 

295. Two phonemic classifications of the Japanese nasal sounds 
are therefore possible. The first, which seems more natural from 
the European standpoint, is to regard syllabic rp, n, g as members 
of the m, n, g phonemes respectively, and to regard g as a separate 
phoneme. The second, which for some not easily definable reason 
appears more natural to a Japanese, is to regard g together with 
syllabic gi, n and g as constituting one phoneme, and the non- 
syllabic m, n, g as belonging to phonemes separate from this. 
Presumably one should accept in preference the Japanese view¬ 
point. A broad transcription of Japanese on this basis would 
therefore require that we should write sagpo, sigbug, segmog, 
seqdo:, etc. For symbol economy n may be substituted for g; this 
gives the spelling used in the Kunreisiki Romazi orthography. 

296. The expression “related in character” (§§31-33) has thus 
to be taken occasionally to include "related in manner of use." 

297. Manner of use must, I think, be advanced as the chief 
reason for regarding the Parisian French a (“e mute”) as a phoneme 
separate from oe, if indeed a can be considered to be a phoneme 
at all in that language. 6 The function of a in Parisian speech is 
a peculiar one. The sound appears to be merely a vocalic noise 
which speakers insert here and there in order to prevent the 
occurrence of certain sequences of consonants alien to their manner 
of speaking. It is immaterial to the sense of all ordinary words 
whether an a is inserted at the place appropriate to it, or whether 
it is omitted. This vowel is thus used in a different way from 
other vowels. If a word said in isolation contains a, that a is often 


4 His identification of these sounds receives a certain support from the fact 
that words compounded with a word ending in -g have a pronunciation with 
gi, g or g according to circumstances: compare sag (three), saqipondo (three 
pounds), sagibeg (three times), sanpbjaku (three hundred), sando (three 
times), sagkiro (three kilos). This consideration is, however, not conclusive, 
since on,e must always make allowance for the possibility that the formation 

of compound words may involve a change of phoneme. 

6 The remarks in this and the following paragraphs do not apply to types 
of French in which all final " e mutes” are sounded as a, e.g. where bout and 
boue are distinguished as bu and bua. They probably apply to types of French 
in which bout and boue are distinguished by length (bu, bui). 
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omitted in connected speech, generally in accordance with rather 
complicated principles. 6 Moreover, the conditions under which 
a is inserted or omitted differ in different parts of France and 
French-speaking Switzerland. 

298. Proper names form exceptions. In them the sound a is 
more stable than in ordinary words, and it seems to be this fact 
alone that causes a to constitute a "speech-sound” of Parisian 
French. If it were not for the proper names, I doubt if it would be 
possible to regard the a of Parisian French as a phoneme any more 
than we can regard the common sound ? as a phoneme of ordinary 
English. 7 The ? of ordinary English and the a of ordinary words of 
Parisian French are so to speak accidental sounds. The ? of English 
is a sign of word separation, or is prefixed to vowels to give them 
emphasis; its presence or absence cannot affect the meaning of a 
word. The presence or absence of a in ordinary Parisian French 
words is largely determined by the sound of neighbouring words 
in connected speech, and is not essential for the intelligibility of 
the word containing it. In proper names, however, its presence 
may be necessary to intelligibility. I do not know of any pair 
of words differing solely by the presence or absence of a, 8 but a 
case like kartare ( Carteret) and ekartre ( ecarterait) or port re ( portrait, 
porterait) suffices to show that such a distinction is a possible one. 9 

299. The quality of the noise which we are here representing 
by a is intermediate between the French vowels 0 and ce. Its 
precise value varies somewhat from speaker to speaker and to 

* These are formulated in various books on the phonetics of French. See 
for instance Armstrong’s Phonetics of French, Chapter XX. 

7 I refer here to the types of English usually described in books on phonetics. 
There are other types in which ? is an essential sound, "replacing" the t 
of 'litl (little), etc.; in these types it either constitutes a phoneme or is a 
member of the t-phoneme. 

8 Unless we take into account the exceptional word daoir (dehors). It 
may, however, be held that this word contains a kind of h, or that its first 
vowel is a variety of 0 or of ce. The case of SSTE (serein, serene) and sre 
(serin, stupid person) is probably also inadmissible, since S3t£ belongs to 
formal speech and sre to the colloquial language. 

8 The proof afforded by this name is not absolutely conclusive. The sound 
of kartare is hardly to be distinguished from kartcere, so that it might 
reasonably be maintained that the sound used in this word is oe. I think 
it likely, however, that other proper names containing an undoubted 3 exist 
which would demonstrate the proposition. 
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some extent according to phonetic context; it is, for instance, 
nearer to oe after r (r) as in pramje [premier), otramd ( autrement), 
and nearer to 0 in various other combinations, e.g. in bazwg ( besoin ), 
nav 0 (neveu). The difference between a and 0 may be considered 
to be significant, though the conditions under which comparable 
contexts are found are always of a rather special nature: either 
the contexts are in word-groups, which strictly speaking do not 
count for the purposes of phonemic classification , 10 e.g. 33 ku:r 
(je cours), 30 ku:r [jeu court), or the distinction is helped out by 
the greater length of the 0 , e.g. 3 ane (genSt), 30 ns ( jeunait ). The 
difference between a and oe is also supposed to be significant, but 
it is difficult to find examples, and such examples as there are are 
not conclusive, sale (celait) and soele (seulet) are supposed to be 
pronounced differently, but the difference when any is made is 
slight, and it is certain that these rare words would be recognized 
in sentences by the verbal context and not by the difference of 
vowel. Other examples may be found in word-groups, which, as 
already mentioned, do not provide good evidence for phonemic 
grouping (§34). navo [neveu) is distinct from noev 0 [neuf ceufs) 
and la rtu:r [le retour) is supposed to differ in sound from loer tu:r 
[leur tour). 11 

300. The French sound a is thus too near both to 0 and to oe 
to be capable of effecting significant word distinctions by ordinary 
interchange with them. In other terms, if a were looked upon as an 
ordinary vowel, it could not be considered as a phoneme separate 
from both of these; it would have to be attached to one of them— 
presumably to oe. In view of the special circumstances, and in 
particular of the facts that ( 1 ) a is functionally different from 
ordinary vowels, and (2) though a resembles oe in sound it cannot 
be grouped into the same phoneme as oe on account of its particular 
manner of use (§297), I think we are justified in calling it a phoneme 
for the reason given in §298. 

301. It has been suggested that the syllabic use of certain 
consonants in English and in colloquial German is a phonemic 
phenomenon, and that such “syllabic use” should be held to belong 

10 §§3l, 34. 

11 This latter example is quoted from Passy (Les Sons du Franfais, 7th 
and subsequent editions, §254). Whether most Parisian speakers make any 
difference between these two sequences is doubtful. 
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to the same phoneme as short a. Such a view may perhaps be 
upheld if a phoneme is taken to be an ideal or abstract sound 
(§20 and Chap. XXIX). On this hypothesis one may say that the 
abstract a is manifested concretely as a in such words as 'brekfast 
{breakfast), 'bulak [bullock], 'me0ad [method), but that its concrete 
manifestation takes the form of “syllabic use” in words like 'mAtn 
[mutton), 'ped| [pedal). 12 Stated otherwise, the speaker “thinks” 
'mAtan and 'pedal but utters 'mAtn and 'ped|. But I do not 
think the syllabic manner of using such consonants can be said 
to belong to a phoneme if the phoneme is defined in a physical 
way as in §31. Objectively, words like 'mAtn, 'pedj merely contain 
n’s and I’s which are somewhat longer than most non-syllabic 
n’s and I’s and which receive intonations in the same way that 
vowels do. 13 


CHAPTER XIX 

THE OVERLAPPING OF PHONEMES 

302. There may be rare cases where two phonemes contain 
a particular sound as a subsidiary member; the phonemes would 
be said in such a case to “overlap” or to “intersect.” Two types 
of phonemic overlapping may be possible. The first would be 
when the common sound belongs to one phoneme in one context 
and to the other phoneme in a different context. The second 
type is when a sound occurring in one particular context may be 
assigned to either of two phonemes. 

Overlapping in Different Contexts 

303. When two phonemes comprise members differing con¬ 
siderably from each other, it may happen that a member of one 
of them used in one context is near to or possibly identical with a 
member of the other used in some other context. 

304. This type of phonemic overlapping can generally be 
illustrated by means of diagrams. It will be recalled, for instance, 

12 The syllabic mark is generally omitted in simple transcriptions. 

13 Trubetzkoy and others have regarded such syllabic consonants as 
" realizations of the phoneme sequences an, al ” ( Grundziige der Phonologie, 
p. 56, Cantineau’s translation, p. 64). See also G. L. Trager’s article on 
The Transcription of English in Le Maitre Phonetique, Jan., 1935, p. 12. 
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that the various members of a vowel phoneme may often be 
visualized as lying within an “area” in a two-dimensional vowel 
figure, as shown, for example, in Figure 2. With most languages 
the "areas” corresponding to each vowel phoneme are fairly small 
and are separated from each other by relatively wide spaces. Thus 
the vowel phonemes of Spanish lie in small “areas” like those shown 
in Figure 7, which indicates that no member of the a-phoneme is 
near to any member of the e-phoneme, and so on. Where, however, 
the areas are large, it may happen that they overlap. 



a 

Fig. 7. Areas of the Spanish Vowel Phonemes. 

305. Such overlapping may be observed in the a and oe phonemes 
of many Northern French people. The “areas’- covered by the 
phonemes used by such speakers are approximately as shown in 
Figure 8. The backest member of the a-phoneme is the one used 
before r followed by another consonant, as in part ( porte ). A fairly 
back one is used before r only, as in ka:r (corps). Fronter members 
occur in other situations, e.g. in am (homtne), ban (bonne), etaf 
(etoffe), klaj (cloche). 1 Retracted varieties of oe are heard when r 
precedes or follows, as in oert (heurte), koe:r (caeur), 'roefeir 2 (re/aire), 
while fronter varieties occur in such words as 3 cen (jeune), cef (ceuf), 
voel (veulent). 

306. It will be seen from Figure 8 that the frontest French 
a resembles the backest oe—that the vowel sounds in oert and 


1 The frontness of the a in such words is the cause of considerable difficulty 
in the acquisition of the pronunciation of such words by English learners. 

1 Emphatic form of ra'feir. 
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kbj, for instance, are very near to each other. The two areas 
overlap, and a detailed examination of all the distinguishable 
variants in each phoneme might well reveal at least one sound 
common to both of them . 3 (See also §309.) 



Fig. 8 . Overlapping of the French 3 and oe Phonemes. 


307. With some French speakers there would appear to be 
overlapping of the e and e phonemes. (See §215.) The shades of 
vowel sound used have certain relations to the neighbouring sounds 
in the words. In particular it seems that closer varieties are used 
by many before s and t, especially when these are followed in turn 
by e or i, and that opener varieties are used before r. Thus the e 
of presi ( precis) is closer than that of ero (heros), and the e of 
Ekski ( exquis ) is closer than those in SErvo ( cerveau) and Eroeir 
(erreur). Consequently there is little difference between the closest 
variety of e and the openest variety of e. In fact it is probable 
that in the pronunciation of many, these sounds axe identical, 
or that the closest e is even higher than the openest e, e.g. that the 

3 It does not appear possible to connect the forms 3 cel i and apsoely, which 
are so often heard in the North of France as alternatives to 33 ! i and apssly, 
with phonemic classification or with the overlapping of the French 3 and oe 
phonemes. If, as is generally assumed, 3 celi and apsoely ^are modern pro¬ 
nunciations of joli and absolu replacing older forms with 3, they are merely 
examples, of the replacement of one phoneme by another in the course of 
historical development (see §710). Why this change has been made is not 
clear. Vowel harmony has been suggested as an explanation, but this theory 
is not borne out by the pronunciation of other words which usually have 3 
in analogous positions, e.g. .pssibl (possible), psli ( poli ), 3fis (office), msdifje 
( modifier ). 
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e of ekski may be identical with or even closer than the e of ero. 
(See further, §312.) 

308. In Russian there is probably a certain overlapping of the 
e and a phonemes. Both these phonemes cover considerable 
"areas.” The e-phoneme has members ranging from about cardinal 
e (between two soft consonants, as in pjetj (nen,, to sing)) to a 
very open and somewhat retracted sound well below cardinal £ 
(initially or preceded by J or 3 and followed by a hard consonant, 
as in 'etat (otot, this), ^ertva (ntepTBa, victim) ). The a-phoneme 
has members ranging on the outside of the vowel figure from a 
raised front a which might be denoted narrowly by a* or ae (between 
soft consonants as in pjatj (naTt, five)) and a very back a (preceded 
by a hard consonant and followed by I, as in 'palka (najina, stick)). 
In addition there are a-like variants used in weakly stressed positions. 
There is thus in all probability an overlapping of the type shown in 
Figure 9. 




e |e«j in pjec|-^- 

e [e»] in njet— 


e [e-] in 

a [ae] in pjatj n||||\* ' 



Fig. 9. Overlapping of the Russian e and a Phonemes. 


309. Overlapping of the type described in the preceding para¬ 
graphs does not interfere with phonemic grouping. The grouping 
presents no difficulty so long as the necessary degree of acoustic 
separation between phonemes occupying neighbouring “areas” is 
maintained in each phonetic context. In French, for instance, d 
and ce are clearly distinguished in any given context: ko:r (corps) 
is distinct from koe:r (occur), and the o’s in vol (vol), etof (etoffe) 
are equally distinct from the oe’s in voel (veulent), oef («/). 
Similarly, in the pronunciation of those referred to in §307, the 
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low “e moyen” and the low “e ouvert” are distinct from each 
other in ero (keros) and croeir (erreur), while the higher “e moyen” 
and the higher "t ouvert” are equally distinct from each other in 
presi ( precis ) and prcse ( firesser) and in ebesil ( imbecile ) and 
VEStjeir ( vestiaire ). So too in Russian the distinction between the 
vowels in pjetj and pjatj is a very wide one, and the distinction 
between those in 'etat and sat (caa, garden) is equally so. The 
similarity between the French sounds in oert and kbj or between 
the Russian e of 'etat and the a of pjatj is immaterial to the phonemic 
analysis. 

310. Overlapping of the type we have been considering is 
uncommon in consonants, and I am at present unable to quote 
any certain examples. It is, however, possible that overlappings 
of q and k might be found in languages containing q and iu. In 
such a language it would not be surprising to find that the q of the 
syllable qi had about the same articulation as the k of the syllable ktu. 

311. Another possible example concerns the overlapping of 
English t and r phonemes. There are in England many who 
pronounce such words as Saturday, beautiful with a voiced alveolar 
flap [r], thus ['saradi, 'bjuifafl]. I have not had an opportunity 
of examining in detail the speech of any of these, but I have the 
impression that this [f] is generally distinct from the r-sound they 
would use in a similar situation, namely, [j] as in ['kaji] (carry), 
and distinct also from their d in 'lads (ladder). If this is so, their 
[r] is evidently assignable to their t-phoneme. There are also 
English speakers who use a rather similar [f] after 9 and 9, i.e. 
in such words as [0ri:] (three), ['bs:9rait] (birthright), ['breSfin] 
(brethren). This [r] cannot be held to belong to the t-phoneme, 
because the sequence 0t is a possible one 4 * ; nor can it be attached 
to the d-phoneme, since 0d and 3d are also possible sequences. 6 
The sound can therefore only belong to the r-phoneme, since the 
[j] of other contexts is excluded from use after 9 and 9. 6 If, as 

4 It is uncommon, but it is found in ba:9t (berthed) and a:0t ( earthed ) 

and in proper names like 'hi:9ton ( Heathton), 'smiBtan (Smithton). 

6 They occur, for instance, in 'ba:9dei (birthday), beidd (bathed), wiQ'dro: 
(withdraw). 

6 I refer here to those whose normal r-sound is [j], and who use this in such 
words as [!kaji] (carry), ['veji] (very). What is said here does not apply 
to those who use [f] in these words. 
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is possible, there are speakers who combine the two above-mentioned 
forms of pronunciation, their t and r phonemes would overlap in 
the sound r. 7 

312. The fact that the same sounds or two very similar sounds 
may occur in different phonetic contexts where there is overlapping 
does not mean that they should be written in phonetic transcription 
with the same letter. On the contrary two separate letters should 
be used to distinguish the phonemes even if the sounds are barely 
distinguishable. The values to be attached to the letters in phonetic 
transcriptions are not absolute but must always be interpreted in 
the light of the phonetic context. Thus it may happen that in 
particular cases in the transcription of French the letter £ may have 
to denote a sound very similar to or even closer than that represented 
by e under other conditions. (See §307.) 

313. It is possible to have overlapping of members within a 
phoneme. Thus the l-phoneme of Southern English is usually 
represented as consisting of two important members, a “clear” I 
and a "dark” (velarized) I, which are used in the contexts described 
in §71. There exist, however, shades of these. When a clear I 
precedes a back vowel, the variety used is less clear than when a 
front vowel follows. And similarly a dark I following a front 
vowel is less velarized than a final syllabic I or a dark I following 
a back vowel. Thus with many speakers the I in lo: [law) is much 
less clear than that in li:f [leaf), while the dark I in bild [build) 


1 My attention was first called to the possibility of the overlapping of t 
and r phonemes by Bernard Bloch's article on Phonemic Overlapping in 
American Speech, December, 1941, p. 278. At the time when Bloch wrote 
the article he thought he had found instances of this overlapping in a certain 
type of Mid-Western American English. He wrote to me, however, in 
September, 1942, saying that he had found it necessary to change his view, 
and that he considers the two f’s in cases such as those above-mentioned 
not to be identical. And as I too have not actually found anyone whose 
speech exhibits the combination of the two features above referred to, it 
cannot be claimed that the example is a convincing one, though there remains 
the possibility that the case may be found to exist. 

Bloch also told me that he had come to the conclusion that “overlapping' 
of phonemes is everywhere and under aU conditions inadmissible.” It must 
be conceded that it is difficult, and perhaps even impossible, to prove that 
two sounds occurring in different phonetic contexts are “identical.” Never¬ 
theless, in the present state of our knowledge I hardly think it possible yet 
to maintain such a generalization with complete confidence. 
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is less velarized than those in 'dAbl ( double) and sku:l (school). 
Hence with some there is an overlapping, by which the clear I 
in lo: is about identical with the dark I in bild. This is the case in 
my pronunciation. And I think it likely that some Southern 
English speakers could be found whose I in bild was definitely less 
velarized than their I in lo:. 

Overlapping in the Same Context 
314. When a sound occurs in a certain phonetic context to 
the exclusion of two nearly related sounds, it is possible, though 
not necessarily suitable, to assign it to the phoneme of one of the 
related sounds in some words and to that of the other related sound 
in other words. If this course is adopted, the two phonemes 
overlap, and the sound in question is shared between them. There 
is some doubt as to whether this procedure is ever justifiable. The 
question is discussed in §§321 ff., and examples are given there of 
cases in which “di-phonemic” treatment of a sound might possibly 
be resorted to. 


CHAPTER XX 

DI-PHONEMIC SOUNDS 

315. Where there is no “overlapping” of the first type, as 
described in §§303-311, it must I think be taken as axiomatic 
that in all ordinary circumstances a given sound cannot belong 
to more than one phoneme of a language. 1 Very rarely, however, 

1 I have heard it suggested by one phonologist that the phoneme should 
be so defined as to allow Q to belong to the n-phoneme in the English word 
il]k, though admitting it to be a separate phoneme in such a word as sil]. 
The admission of such a possibility would, in my view, render the whole 
phoneme theory unworkable. For if r) can belong to the n-phoneme in 
words like igk, but not in words like sil], then m might with equal right 
be classed as a member of the n-phoneme in words like lamp, though not 
in words like ham. Or we might, with equal right, reverse matters and 
assign the m of lamp and the n of end to the I] -phoneme. Moreover, on 
the same principle, we might regard S as belonging to the Z-phoneme in 
Stand and in boks (box), on the ground that Z cannot occur in such situations. 
It might likewise be maintained in French that the g of ansgdot ( anecdote ) 
should be assigned to the k-phoneme, on the ground that k does not exist 
before g in the speech of those who pronounce in this way. And so forth. 

I believe it would not be feasible to invent a definition of a phoneme which 
would allow for such possibilities. 
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instances may occur where a certain justification can be found for 
assigning a sound to two phonemes. These exceptional cases are 
where strict adherence to the ordinary principles of phonemic 
grouping would lead to inconvenience in morphology, or would 
obscure unnecessarily the relationships between words. 

316. If a sound is assigned to two phonemes, it may be termed 
"di-phonemic.” 

317. The cases of di-phonemic sounds fall under two heads: 
(1) when a sound occurs “incidentally” as well as "expressly,” 
and (2) when there are equally good reasons for assigning a sound 
to either of two phonemes. In both cases the assignment of a sound 
to two phonemes should not be made without the strongest possible 
morphological or other grounds. If great care is not exercised we 
may easily find ourselves faced with impossible proposals such as 
those mentioned in footnote 1 to §315. 

Sounds Occurring "Incidentally” as well as "Expressly” 

318. The nasalized vowels of some Indian languages should, 
I believe, be regarded as di-phonemic on the ground that they 
occur incidentally as well as expressly. The Hindi sound e is 
a case in point. In this language vowels are regularly nasalized 
when a nasal consonant precedes. Thus the agentive suffix 
commonly written ne is usually pronounced ng, though Indians 
are not as a rule aware of the fact; they think they pronounce 
ne, and the pronunciation ne would be accepted as correct. The 
nasalization is “incidental.” The sound g occurs also "expressly” 
as a phoneme separate from e. It generally occurs after non¬ 
nasal consonants, e.g. in pe (singing), gh5t (throat), cjegi (boat). 
But it sometimes occurs "expressly” after a nasal consonant; it 
does so, for instance, in the word mS (in), and in the case of this 
word a Hindi-speaking Indian will not accept me as a correct 
substitute. 

319. It therefore seems advisable in Hindi to assign the 
sound 6 to the e-phoneme in some words where a nasal consonant 
precedes, but not in others. The words in which this treatment 
is necessary can often be recognized by analogy (as when the sound 
denotes the suffix -e and not the suffix -e), or they may be ascertained 
by asking a Hindustani-speaking Indian to pronounce slowly, in 
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which case the incidental nasalization is with some speakers not 
complete. Essential nasalization is moreover shown in Indian 
writing. 2 

320. It seems that cases of a similar kind are to be found in 
various languages of West Africa. Thus, according to Wester- 
mann, the Ewe word ma (not) is indistinguishable in sound from 
ma (to divide); but in ma (not) the nasalization is incidental, 
whereas in ma (to divide) it is essential. In slow speaking the 
words would be distinguished. For this reason we may be justified 
in assigning the "incidentally nasalized” a to the a-phoneme. It 
must, however, be borne in mind that such a reason is only adduced 
for practical convenience; what is done in slow speech or any 
particular variety of speaking has in reality nothing to do with the 
phonemes of other styles of speaking. 

Sounds Assignable to either of Two Phonemes 

321. If a case were found in which there were equally good 
reasons for assigning a sound to either of two phonemes, an arbitrary 
classification would have to be made. There would be three possible 
solutions to the problem presented in such an event. (1) The 
sound might be assigned to one of the two phonemes chosen at 
random, (2) the sound might be treated as a separate phoneme 
independent of the other two phonemes, (3) the sound might be 
treated as di-phonemic, i.e. assigned to one phoneme in some words 
and the other phoneme in other words. 

322. The first solution would no doubt be the better one, even 
if no grounds could be found for preferring one of the phonemes 
to the other. The second should, I think, be dismissed as un¬ 
practical; it would involve introducing an unnecessary letter into 
broad transcriptions. The third plan could not be justified on 
phonetic grounds, but it might be found to have a certain practical 
convenience from the linguistic point of view. It would involve 
taking into consideration external facts unconnected with the idea 
of the phoneme. Such are the history of the sounds, the sounds 
of related dialects, analogy, grammatical relationships and perhaps 
even conventional writing. 


2 It is noteworthy that the Hindi word m a ( mother) has a recognized 
variant form ma with essential nasalization. 
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323. Some difficulties in phonemic classification, possibly 
involving di-phonemic treatment of some sounds of Pekingese, 
were suggested in §196. 

324. I had hoped also to be able to illustrate di-phonemicism 
by examples from Japanese. But on enquiry it seems that con¬ 
vincing examples are difficult or perhaps impossible to find, although 
the language presents several features suggesting that such treatment 
may be appropriate. The pronunciation, and therefore the pho¬ 
nemic system, has been undergoing a number of changes in recent 
years: I have been informed for instance that the syllable du 
[narrowly dzu] is no longer in general use, but has been replaced 
very commonly by the syllable zu. New sounds are coming into 
use owing to borrowing or tendencies to acclimatize foreign words 
(with their foreign sounds). Words which might at one time have 
served as examples have gone out of use. The Kana syllabic writing 
has been altered in some respects, so also have the newer systems of 
romanic writing. It looks as if such changes in the written and 
spoken languages are going to continue, so that anything one may 
say now about the phonetic or the phonemic systems of Japanese 
is likely to become out of date before long. One cannot illustrate 
phonemic phenomena in a language which is in such a changeable 
state. 

326. To illustrate possible di-phonemicism in language I will 
therefore give now a simple hypothetical example suggested by 
Japanese. Let us suppose that a language exists in which every 
syllable is composed of a consonant followed by one of five vowels 
[i, e, a, o, u], and that among the consonants are a [d], a [z] and an 
affricate [dz], and that these sounds are used according to the 
following system: 

[d] occurs solely before [i], [e], [a] and [o] (with almost imper¬ 
ceptible variants according to the vowel), 

[z] occurs solely before the same vowels (also with almost imper¬ 
ceptible variants according to the vowel), 

[dz] occurs solely before [u]. 

326. In this hypothetical language the [d] and [z] sounds ob¬ 
viously belong to separate phonemes, which we may call the 
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d-phoneme and the z-phoneme. As the sound [dz] occurs only in a 
phonetic context from which [d]-sounds and [z]-sounds are excluded, 
it does not “constitute” a phoneme separate from these, but it must 
be grouped either into the d-phoneme or into the z-phoneme, or 
perhaps di-phonemically into both. And as the nature of the 
sound [dz] can be such that there are no phonetic grounds why it 
should belong to one rather than to the other, we may find ourselves 
in the position of having to “allocate” it suitably. 

327. It is in this sort of case that it may be considered justifiable 
or even advisable to settle the allocation by taking into account non- 
phonetic criteria such as those suggested in §322. 

328. A historical possibility might be that at one period in the 
history of the hypothetical language the syllable [dzu] was non¬ 
existent, but that some of the words now containing it used to be 
pronounced with [zu] while others used to be pronounced with 
[du], A change of pronunciation subsequently took place by which 
the [zu] and [du] fell together and both came to be replaced by the 
analogous syllable [dzu]. Such a development might well leave a 
tradition (possibly supported by archaic writing) that [dz] had a 
relationship with [z] in some words and with [d] in others. This 
tradition might well be sufficient to suggest that the present day 
[dz] should be treated di-phonemically, i.e. assigned to the z- 
phoneme in some words and to the d-phoneme in others. 

329. I have been informed by a Japanese scholar that a develop¬ 
ment of this type is believed to have occurred in the Japanese 
language. 

329 (a). If there should exist present day dialects of the hypo¬ 
thetical language in which the syllable [dzu] does not occur, but 
the words containing [dzu] in the standard language fall into two 
categories, one set of words being said with [zu] in the corresponding 
words of those dialects, and the remainder having [du] in the same 
dialects, it would no doubt be advisable to treat the dzu of the 
standard language di-phonemically in a manner corresponding to 
the dialectal forms. 
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329 (b). It appears that examples of this type are to be found 
in Japanese. I have been informed, for instance, that the two 
words [midzu] of standard Japanese (meaning "water”and “not 
see” respectively) are distinguished as [midu] and [mizu] respectively 
in dialectal speech of Shikoku where the sound [dz] does not occur. 

(These two words also differ in tone: midzu [.-] (water) but midzu 
[•.] (not see).) 

329 (c). If there were to exist as cognates to words containing 
[dzu] words having [z] or [d] with vowels other than [u], it would 
no doubt be found advantageous to assign to the z-phoneme those 
[dz]-sounds which have cognates with [z], and to the d-phoneme 
those which have cognates with [d]. There would be no phoneti¬ 
cally valid reason for such allocations. They would, however, 
afford a means of distinguishing some of the homophones in a 
practical orthography for the language. 

329 (d). The following are examples of this procedure which 
may possibly apply to some Japanese. In [kadzu] (number ) 3 it 
is desirable to assign the [dz] to the z-phoneme, since the word is 
cognate with kazoeru (to count). On the other hand it might be 
considered advisable to assign the [dz] of the classical verb [idzuru] 
(go out) to the d-phoneme on account of its kinship with the corre¬ 
sponding colloquial deru. 

330. The following is probably an example from substandard 
Japanese which may be treated as a case of di-phonemicism. In 
this dialectal form of speech there exists a consonant of the [g]-type 
which occurs before [i] to the exclusion of the [h] of standard 
Tokyo pronunciation (which is a weak variety of [ 5 ]) and also to 
the exclusion of [s ]. 4 If anyone desired to make a phonemic trans¬ 
cription of this dialectal form of speech, this [ 0 ] might be assigned 


3 Referring to the speech of such Japanese as there may be who pronounce 
[kadzu] consistently with [dz] and do not use a variphone (Chap. XXVIII) 
comprising [z]. 

4 This pronunciation is common "in the popular speech of the KantO 
[Tokyo, etc.] region, and probably beyond it” (H. E. Palmer, Principles of 
Romanization, p. 85). It accounts for the difficulty that many Japanese have 
in distinguishing he from she when learning to speak English. 
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to the h-phoneme or to the s-phoneme, or alternatively for an 
external reason the unusual course might be followed of treating 
the sound di-phonemically, allocating it to the h-phoneme in some 
words and to the [s]-phoneme in others. 


331. The “external reason” would be that the speech of those 
who confuse [h] and [s]-sounds is not considered good by the educa¬ 
tion authorities, and the modern official writing distinguishes the 
two sounds by separate symbols. 

332. It has been suggested that the Russian sounds of the 
i-type might be treated as di-phonemic, belonging to the i-phoneme 
in words which have cognates with stressed i, or where ordinary 
spelling has h, and to the e-phoneme in words having cognates 
with stressed e or where ordinary spelling has e or a or h.- 6 The 
suggestion would involve assigning the varieties of i to the i-phoneme 
in such words as [zji'ma] (3niwa, winter), 6 [Iji'la] (jiujia, she poured),’ 
['Ijudjt]. (juoflu, people), [pjisj'mo] (inictMo, letter), but to the e- 
phoneme in such words as [dji'la] (gejia, affairs), 8 [sjimjt'na] 
(cesteHa, seeds),* [pji'ro] (nepo, pen), [I'motsija] (omouhh, emotion), 
[mji'dvjetj] (MeaBe^t, bear), [jij'tso] (niffio, egg). 

333. I feel it to be most unsatisfactory to resort to non-phonetic 
criteria such as derivation and spelling, and fortunately a detailed 


8 Reference is here made to the (apparently older) style of Russian in 
which weakly stressed H and e or a are pronounced with precisely the same 
variety of l in similar phonetic contexts, i.e. the style in which the t’s of 
[pjt'rok] (impor, pie) and pjl'ro (nepo, pen) -are identical. (It appears 
that at the present day a differentiation is generally made between the two 
classes of words, a variety of e and not l being employed in the words 
written with e, 9 and H. This style of speech does not illustrate the point 
under consideration.) 

8 Compare 'zjimt (SHMbl, winters). 

7 From Ijitj (JlHTb, to pour). 

8 From 'djela ['djels] (Aejio, thing). 

8 From 'sjemja ['sjemja] (ceMH, seed). 
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examination of the circumstances under which i and e and the 
related sounds are used in Russian leads to the conclusion that 
we can manage well without such criteria. We can, in fact, show 
that there is no reason to depart from the fundamental principle that 
a sound cannot belong to two phonemes, and we can make out a 
good case for assigning every l 10 to the i-phoneme. It is true that 
from some points of view it seems immaterial whether we assign 
i to the i or to the e phoneme, since it occurs to the exclusion 
of both the i and the e sounds in weakly stressed positions, and is 
probably about equidistant acoustically from the openest stressed i 
(as in bjil) and the closest stressed e (as in pjetj). On the other 
hand there is a circumstance which tells in favour of considering i 
as a member of the i-phoneme, namely, the existence of a sound t— 
a kind of a which is more i-like than the ordinary Russian a of such 
a word as ['doma] (jiOMa, at home). This sound i may be regarded 
either as a lowered i or as a retracted e; it would seem, however, to 
be definitely nearer to i. It is used in weakly stressed positions in 
surroundings similar to those in which ♦ is found in strongly stressed 
positions, that is to say, after hard consonants and J, 3 and ts. 
Examples are ['gort] (ropu, hills), [krt'lo] (Kpuno, wing), ['sontst] 
(cojiHue, sun), [ft'stoj] (uiecToii, sixth), [ft'pjetj] (uiniieTb, to hiss). 
1 is treated similarly in respect to i. It is used, to the exclusion of t, 
in weakly stressed positions in the same surroundings as those in 
which i occurs in strongly stressed positions, that is to say, initially 
and when preceded by soft consonants. Examples are those quoted 
in §332. These parallel relationships of t to i and 1 to i seem to 
me to constitute an adequate reason for classing the sounds phonem- 
ically in a similar manner, i should therefore be considered as 
belonging to the same phoneme as i, just as t falls into the same 
phoneme as t. And since t belongs to the i-phoneme (§97), so also 
must 1 and 1 . 

334. It will be observed that this manner of classification 
shows a symmetrical use of the vowels. The i, a and u phonemes 
occur under all conditions of stress, but the use of the e and o 


Win this paragraph the symbol l is to be taken to mean “the various 
Russian (.-sounds.” The shade of sound used depends upon the adjoining 
consonants: thus the t’s of [pjtsj'mo] and [mjl'dvjetj] are closer than that in 

[djt'la]. 
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phonemes is confined to strongly stressed syllables. I am disposed 
to regard this symmetry as additional evidence in favour of the 
above classification of i. 

335. The phonemic grouping of all these sounds, as I see it, 
is shown in Figure 3 (p. 27). 

336. If this is the correct way of classifying them, the broad 
(and digraphic) transcription of the words quoted in §332 would 
be as follows: zji'ma, Iji'la, 'ljudji, pjisj'mo, dji'la, sjrmji'na, pji'ro, 
i'motsija, mji'dvjetj, jij'tso. u 

337. Some have expressed the opinion that for similar reasons 
the English short a might be regarded as a member of two or even 
more of the English "strong vowel” phonemes, such as a and A. 
It has been said in favour of this course that short a only occurs 
with completely weak stress, but that such sounds as a and short 
a always have' either a strong or a medium stress. It has in fact 
been suggested that all the opener English short vowels overlap 
in the sound a. Personally I do not think that this view can be 
maintained, and that for three reasons. Firstly, as explained in 
§§197-202, it seems to me best to assign the ordinary Southern 
English short a to the same phoneme as the long a:. Secondly, as 
explained in §204, it does not appear to me admissible to postulate 
that such sounds as a and short a are incapable of receiving weak 
stress. And thirdly, even if that postulate could be shown to hold 
good and at the same time a reason were found for not grouping 
short a with a:, I think the sound should not be considered di- 
phonemic but should be assigned to the A-phoneme, since the 
principal a is nearer in sound to the principal a than to any other 
of the short "strong” vowels. 

338. The examples given in §§323-333 show then that when 
there are grounds for assigning a sound to either of two phonemes, 
it may perhaps be occasionally convenient to treat that sound as 


u It is noteworthy that if this classification of l is correct, Russian exhibits 
a treatment of words containing e similar to that found in English. Compare 
English di'fa: (defer) but 'defarans (deference), ris'ta: (restore) but resta'reijn 
(restoration), 'trAmpit (trumpet, from French trompette, now pronounced tr5p£t 
and doubtless also pronounced with E when introduced into English in the 
fourteenth century), 'ddlknis (darkness, formerly probably dEjk'nES, etc. 
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di-phonemic, but that di-phonemic grouping of such a sound is 
never a necessity. It is always possible to assign it arbitrarily to 
one of the phonemes, and as a rule this is probably the best course. 
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CHAPTER XXI 

FUNCTIONS OF LENGTH, STRESS AND VOICE-PITCH 

339. Every speech-sound (phone) has taraber, duration, stress 
and (if voiced) voice-pitch. These attributes are usually distinct 
from each other, but it must be remarked that in exceptional cases 
a tamber has to be held to contain a duration, stress or pitch element. 
The chief of these cases are: plosive consonants (where the plosion 
is “instantaneous”), semi-vowels such as w, j, y (q) (which have 
to be very short), voiceless b, d, etc. (which differ only in degree 
of force from unaspirated p, t, etc.), the Arabic S (which necessarily 
has a very low voice-pitch). 

340. In the preceding chapters we have been concerned mainly 
with tamber. It is now necessary to turn our attention to length, 
stress and voice-pitch and their functions, when these attributes 
are separable from tamber (as is usually the case). 

341. It will be found that groupings of lengths and voice- 
pitches can often be made which are comparable to the groupings 
of tambers into phonemes. But before entering into the theory 
of such grouping, attention must be called to an important difference 
of function between tambers and the other attributes. Differences 
of tamber are employed in language chiefly for the purpose of 
distinguishing "words” from one another. 1 In some languages 
degrees of length are likewise used for differentiating words; in 
some the kind of stress or the position of the strong stress (in words 
of more than one syllable) serves the same purpose; and in “tone 
languages” voice-pitches are similarly used. On the other hand 
there are languages in which length, stress and voice-pitch 
play no part in word-differentiation. These attributes have, 
however, other functions which tambers have not. They are made 
use of constantly and in all languages for giving emphasis to particular 
words in a sentence, for indicating contrasts, for reflecting emotional 
states of the speaker, or for suggesting other implications which 
cannot conveniently be expressed by means of words at all. These 
uses are exemplified in the following paragraphs (§342 ff.). 


1 Exceptions being differences between tambers found in the same phoneme. 
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342. Emphasis of particular words may be for “intensity” or 
for “contrast.” The following are some instances of the use of 
length, stress and voice-pitch for purposes of intensity. The meaning 
of the English word i'noimas ( enormous) can be “intensified,” so 
that the word denotes "particularly large,” by giving extra promin¬ 
ence to the already prominent second syllable; this extra prominence 
is effected by an increase of stress coupled with an increase in the 
length of the vowel 2 and the use of a special intonation. Splendid 
can be similarly treated, except that it is the consonant n which is 
lengthened in the emphatic pronunciation. 3 The meaning of 
absolutely (in the sense of ‘very much,’ ‘completely’) may be 
intensified by putting an additional stress, coupled generally with 
a high pitch, on the third syllable ( ll abs3 ,, lu:tli). So also in German 
(e.g. "faibalhaft with a very long a:, "vundarbair or ll vund3r ,l ba:r 
with lengthened n), and in Russian (e.g. udji M vjitjiljna with the i of 
the third syllable longer than usual and the two preceding syllables 
on higher pitch than usual 4 ). 

343. Stress may be used to express intensity in languages where 
it is not employed for word-distinctions. A noteworthy case is 
that of French, where intensity, is shown by putting a special 
strong stress on the first or second syllable of various long words 
(e.g. "fbrmi'dabl, "partikyljerma, a"bDmi l nabl or "abDmi'nabl, 
ap"SDlyma or "apsolyma). In some French words intensification 
may be effected by a stress coupled with a lengthening of an initial 
consonant (e.g. "mialce'ra). 

344. In emphasis for intensity the special attributes are applied 
to the particular words requiring intensification. Contrast, on the 
other hand, generally involves a special type of intonation spread 
over the contrasted word and all the following words in the sentence. 
Its nature depends largely on whether the contrast is with what 
has preceded or with what may be presumed to follow. 

345. The following are some examples from English. 

(1) Contrast with what precedes 5 * : 


2 Also sometimes of the n preceding the vowel. 

3 Not the vowel. The I preceding the vowel is also sometimes lengthened. 

4 yflHBHTHJlhHO ( wonderful ); intonation [‘ * \ . . ]. 

3 Intonations are represented here and elsewhere by the line and dot 

system. Thick lines and dots indicate the stressed syllables. 
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(bat) !'ai wont ta bai it [ \ . . . .] 

(bat) M ai dount wont ta bai it [\ ] 

(bat) ai "wont ta bai it [-\...] 

(bat) ai 'dount "wont ta bai it [ • ’ \ . J 

(bat) ai wont ta "bai it [. • • \ _ ] 

(bat) ai 'dount wont ta "bai it [ ■ *. ■ \ _ ] 

(bat) it 'woznt in"tenjanl 
(bat) it s "main [ -\] 

(bat) it 'sartnli iznt "main 

(2) Contrast with what follows: 

"ai dount wont ta bai it (bat . . . .) . • •' ] 

ai 'dount "wont ta bai it (bat . . .) [. * _ • • ] 

ai 'dount wont ta "bai it (bat . . . .) 

ai "wont ta bai it (bat . . . .) [ • 

it 'woznt in"tenjanl (.) [ • * • y\ ,] 

it 'iznt "main (.) [•*.<'\ / ] 

346 . The following are similar examples from French: 

( 1 ) Contrast with what precedes: 

(me) "mwa 3 emre I aj'te [ \ ] or [*.•■.] 

(me) "mwa 33 n vo 'pa [ \ •••] °r [*••.] 

(me) 3 e a"vi d I aj'te [ • • \.. ] 

(me) 3a n e paz a"vi d I aj'te [.. • • \.. ] 

(me) 3a vo 'bji I aj"te [. * * • \] 

(me) 3a n vo 'pa I aj"te [. • ’ • \] 

(me) 33 n I e 'pa fe eks"pre [.•**. \] 

(me) it et a "mwa [.. • \ ] 

(me) i n e 'paz a "mwa [. • • • \ ] 

(o k 5 tre:r) il e tu"3u:r 5 rta:r [. • ■ \. . ] 

( 2 ) Implied contrast in context of situation: 

(tjg), i fe 'bo o3urdyi [ • • \ ... ] 

( 3 ) Contrast with what follows: 

"mwa 3a n vo pa I aj'te, (me . . . .) [ \ _. • • * ] 
33 n e 'pa a"vi d I aj'te, (me . . . .) [ • • * • \ • • ] 
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In many other cases the intonation is the same as when the contrast 
is with what precedes, e.g. 

33 n Y0 ‘pa I aj"te, (me . . . .) [. • \] 

347. A great number of special meanings may be given to 
sentences by means of intonation. A notable case is when a sentence 
in statement form is pronounced so as to mean a question requiring 
the answer “yes” or “no.” Examples: 

Statement Question 

English 

ju 3 'kould [ -\ ] (you are cold) [ •'] (= are you cold?) 

ju: 'sed sou [ • \ . ] ( you said so) [ . / ] (= did you say so?) 

French 

i 'vjen [ \ ] or [ - \] (they are [ , • ] (are they coming?) 
coming) (ils viennent) 

German 

er ist 'erjlender [. -\ .. ] (he is [..*"] (is he English?) 
English) (er ist Englander) 

Russian 

xara'Jo [ * \j (it is right) [. • / ] (is it right?) 

(xopomo) 

kra'sjiva [' v . ] (it is pretty) [ • / . ]* (is it pretty?) 

(npacHBo) 

•xoladna [ s .. ] (it is cold) [ / .. ]• (is it cold?) 

(xOJIOflHO) 

vi tarn 'bilji [ ’ ‘ N . ] (you have [ • • / . ] 6 (have you been 
been there) (bh TaM 6 hjih) there?) 

Hindi 

thik hai [ ‘, ] (it is right) [. • ] or [ .\] (is it right ?) 

348. The numerous other meanings that can be suggested by 
intonation in non-tone languages have been described in various 

• With the intonations [•_■], [ • * ], ['•_'] these expressions would 
suggest surprise and doubt: “do you mean to say you think it’s pretty?”; 
“do you mean to say it’s cold?”; “do you mean to say you have been 
there?” 
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books and articles, and readers are referred to them for further 
examples. 7 

349. Languages do not all employ the same type of intonation 
for suggesting a given implication. And conversely a particular 
intonation may suggest one implication in one language and a 
different one in another. 

350. I have noticed for instance that some Russians when 
speaking English continually give (to the English hearer) the 
impression of impatience or querulousness or protest when in 
reality they have no such sentiments. The effect is produced 
by a frequent use of pitches much higher than those an English 
person normally uses. If an Englishman says ai 'wontid tu 
iks'plein it ta ju ( I wanted to explain it to you), he would normally 
pronounce the expression with an intonation of the type 

\ ...]. Some Russians, however, while saying the 
sentence with a somewhat similar tone-pattern, will use much 
higher pitches on wont and plein; they adopt in fact an intonation 
which an Englishman might employ when speaking impatiently. 8 

351. Another instance is seen in the above-mentioned intonation 
of the type [.\J used by many Hindi-speaking Indians to express 
a question requiring the answer “yes” or "no,” as in jhik hai 
[. \ ] (is it right ?). This is markedly different from the type [ • • / ] 
or [, •' ] commonly used for this purpose in such languages as 
English and French. In English the intonation [. \ ] is used to 


7 Armstrong and Ward, Handbook of English Intonation ; Palmer, English 
Intonation-, Chapter on Intonation in Jones’ Outline of English Phonetics, 
eighth and subsequent editions; Klinghardt und Klemm, Obungen im 
Englischen Tonfall', Coustenoble and Armstrong, Studies in French 
Intonation-, Klinghardt und Olbrich, Franzdsische Intonations-Obungen- 
Klinghardt, Obungen in Deutschem Tonfall-, Boy anus, Russian Pronuncia¬ 
tion, Chapter XII and texts; Boyanus, Spoken Russian R. Kingdon, 
The Groundwork of English Intonation, also his article Tonetic Stress-marks 
for English in Le Maitre Phonetique, October, 1939, and articles on intonation 
by him in English Language Teaching, Vol. II, Nos. 4, 5, etc. 

8 Compare the Russian i 'on 'dumal Jta 'eta at'kritjja [ • - \ . • • ' \ ] 

with the English hi: ' 0 o:t it W3Z 3 dis'kAVSri .\ ]. A Russian 

man often uses pitches such as F or G above middle C, or even higher, on the 
high stressed syllables of ordinary sentences. An Englishman rarely raises 
his voice as high as middle C in ordinary conversation, unless he is showing 
emotion or suggesting some special implication. 
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indicate astonishment, e.g. 'riali [.\ ]. In Welsh the same intona¬ 
tion is used in the termination of ordinary statements; it does not 
suggest any particular implication: a Welshman will say 'nevar 
r*\J where an Englishman would say 'neva [\.]. 

352. The methods of suggesting implications in tone languages 
are naturally different from those in use in non-tone languages. 
In Tswana, for instance, a statement cannot be converted into a 
question by applying a rising intonation; it is done by shortening 
the vowel of the penultimate syllable of the sentence. Thus when 
o batla nama is a statement (he is looking for some meat) it is 
pronounced o batla na:ma, B the vowel of the syllable na being 
long; when it is a question (is he looking for some meat?), the 
pronunciation is o batla nama, 10 the vowel of na being short. 
Similarly, phaixe 11 means “(it is) a cat," but phaxt 12 with short a 
means “Is it a cat?” 

353. I have been informed by Professor Ida Ward that in 
Igbo and other languages of West Africa notable alterations in 
tone-patterns have to be made in order to convert a statement 
into a question. But here again the alterations are quite different 
from those prevalent in non-tone languages such as English and 
French. Example: Igbo api ga £je [.•..•] (now—1965—spelt anyt 
ga eje, we shall go), [....•] (shall we go?). 13 And L. E. Armstrong 
has noted a number of remarkable interrogative tone-patterns in 
Kikuyu. 14 

354. In Cantonese, where the tones are so numerous as to 
restrict considerably the possibility of making words prominent by 
pitch, emphasis of particular words is effected mainly by length. 
The syllables to be emphasized are made longer than adjoining 
syllables, the length being spread over the vowel and final consonants 


* With more precise tone-marking: 5 batla na:ma. The low-falling tone 
(marked C) and the lowest level tone (marked _) are members of the ordinary 
low toneme (left unmarked). 

10 With more precise tone-marking: 5 batla nama. 

11 With more precise tone-marking: phaiXE. 

11 With more precise tone-marking: phaxg. 

18 From Ward, Introduction to the Ibo Language, p. 120. 

11 See her Phonetic and Tonal Structure of Kikuyu, pp. 72, 75, and elsewhere. 
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(if any) of the syllables; syllables of little importance are said very 
short. 15 

355. But even in tone languages there is such a thing as sentence- 
intonation which can be used for suggesting implications or reflecting 
emotional states of the speaker. The essential relationships between 
the word tones are as a rule preserved, but certain types of sentence- 
intonation can be superimposed on the tonal patterns of sentences 
to suggest the special meanings. 

356. The foregoing examples (in §342 ff.) illustrate the uses 
of length, stress and voice-pitch when these attributes are not 
“significant” in the usual linguistic sense of the term, i.e. when 
they are not used for distinguishing one “word” from another. 
In the following four chapters we shall examine the manner of 
using these attributes when employed as word-distinguishers. 
When so used, they are called by some writers “prosodic” elements 
of speech, as distinguished from the phonemic or qualitative 
elements. 


CHAPTER XXII 

SIGNIFICANT LENGTH 

357. Speech-sounds require time for their utterance; in other 
words, they have “duration," or “quantity” as it is often called. 
Some sounds are by their nature of limited duration, as already 
mentioned in §339. In particular the plosions of plosive consonants 
and the flaps of flapped consonants are necessarily extremely 
short; sounds of a gliding type are susceptible of a certain degree 
of lengthening but they cannot be held on indefinitely. Other 
sounds, namely those termed "continuants,” can if desired be 
held on as long as the speaker’s breath lasts. 

358. In connected speech the durations of continuant sounds 
and of the "stops” of plosive consonants vary greatly, and their 
differences of duration are attributable to a variety of circumstances. 
There are, for instance, quick speakers and slow speakers. Those 

15 Examples of this method of effecting word-prominence will be found 
by referring to the Cantonese Phonetic Reader by Jones and Woo. In many 
of the texts in that book the most prominent syllables are shown in black 
type and the least prominent in italics. 
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who talk fast use shorter sounds than those who talk slowly. There 
are also “drawlers”—people who are in the habit of making many 
of their continuant sounds (especially vowels) unusually long. 
Again the same speaker does not always pronounce words at the 
same rate, so that the sounds he uses in a given sequence are not 
always of the same length. 

359. Then there are conventional usages as to duration in 
particular languages. These usages vary from language to language. 
There may be (1) duration used as a means of distinguishing one 
word from another, (2) variations of duration according to phonetic 
context, (3) special uses of length in the sentence, or for purposes 
of emphasis. It is with the first of these that we are chiefly con¬ 
cerned in this chapter. The second is dealt with in the chapter 
on “chronemes” (Chap. XXIII). The third has already been 
referred to in §§342, 352, 354, and further mention is made of it in 
§§383-385. 

Length and Doubling 

360. As we are here concerned with “true” or “indivisible” 
length and not with “doubling” or repetitions of sounds, and as 
the acoustic difference between a “long” sound and a “doubled” 
sound is often difficult to perceive, it is necessary to preface this 
account of duration by some remarks on the difference between 
these. By “true” or “indivisible” length is meant a “long” 
duration which is not felt by the speaker as a repetition of a sound, 
and which cannot be replaced by doubling. In most languages 
lengthenings are found which have to be considered disyllabic 
and which may be pronounced disyllabically (i.e. as repeated or 
“double” sounds); they are or may be divided into two parts 
by a diminution of intensity in the middle. Such are the double 
short i of such English words as 'emptiii] {emptying), 1 beibiij 
{babyish), the doubled vowels in the French reeliir ( reelire ), saara 
{Sahara), koopere {cooperer) and the double vowels in Tswana 
words such as lubana (young pigeon), lubana (squall), maabapane 
(evening). 1 For examples of double consonants, see §§365 ff. 


1 In these Tswana examples the two vowels have the same tone. Other 
examples of repeated vowels occur very frequently in this language with 
two vowels having different tones. When the tones differ, the disyllabic 
character of the sequence is always very clear. Instances are jaaka (like), 
jaanoQ (now). 
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361. It is probable that the distinction between single and 
double sounds is always potentially significant. The distinction 
between short sounds and sounds pronounced with true length is, 
on the other hand, not of necessity significant, though in many 
languages it happens to be so. 

362. Although the difference between doubled vowels and 
single long vowels may often be inappreciable to the ear, and may 
be even objectively non-existent to listeners, yet the difference 
would seem to be in most cases subjectively perceptible to the 
speaker. Long vowels which are subjectively double may always 
be pronounced double, but with true long vowels a pronunciation 
with double sounds is not a possible alternative: thus it is not 
possible to pronounce the French me:tr ( maUre) as me-Etr or the 
German ba:n ( Bahn) as ba-an. 

363. Whether a vowel heard as long is to be considered double 
is a matter to be decided by the feeling of the speaker, which in its 
turn is determined by reference to the structure of the language. 
If a given long vowel is separable into two, and if therefore in 
precise speech it could be pronounced double, it is convenient to 
regard it always as double. If, however, a certain long vowel is 
never separable into two, and cannot be replaced by a sequence 
of two in precise speech, then the case must be considered to be 
one of true length. The distinction between a lengthened vowel 
and a doubled vowel has not yet, as far as I know, been found to 
be significant in any language; the difference is probably insufficient 
for this. 

364. If the difference between lengthening and doubling is 
difficult to perceive in vowels, it is still more so in the case of 
consonants. I cannot imagine that the difference between a long 
consonant and a doubled one could ever be significant. 

365. In practice it is convenient to regard all intervocalic 
long consonants as double, on the ground that it is usually possible 
in precise speech to separate them into two by a diminution of 
force in the middle, attaching the first part to the first syllable and 
the second part to the second syllable. That long consonants 
are divisible in this way is obvious in the case of compound words 
and words formed with prefixes or suffixes, 2 as in the following 

2 These are the only cases where long consonants are found in English, 
French, German and Russian. 
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examples: English 'bukkeis (book-case), 'pennaif (pen-knife), 'wAnnis 
(oneness), 'djenjuinnis (genuineness), 'souili (solely )-, 3 French 
dozjEmma (deuxiemement), anette (honnitete) ; German fer'raist 
(verreist ),'appaken ( abpacken ); Russian'strannij (eTpairauft, strange), 
atta'vo (oTToro, therefore), ad'datj (oTAaTt, to give away); Hindi 
banna (to be made), janna (to know). 4 So also in Arabic where 
cognate words often have a vowel between the two consonants, 
e.g. dikkarn (shop), the plural of which is daka:ki:n (shops), or the 
Syrian mille (religion) which has plural milal. 

366. When long consonants occur intervocalically with no such 
obvious derivations, it matters little whether they are regarded as 
double or merely as long. To me it seems preferable for practical 
purposes to regard them as double, one part being apportioned 
to each syllable. 6 The following are illustrations of the class of 
words referred to: Hindi, galla (flock), patta (leaf), rassi (rope), 
pilla (dog (male)) 6 ; Somali banzai (she searched). 7 

367. As to initial long consonants, they are best regarded as 
double whenever there is reason to consider the first part as a 
prefix or as an element of a compound. This is the case, for 
instance, in Russian words like 33 atj (cwaTb, reaped) 8 and in Tswana 
words such as mmetli (carpenter), mmila (road). The same 
applies no doubt to the initial long consonants of Luganda, as in 
tta (kill), ggwa (be finished),® which in inflected forms occur inter¬ 
vocalically and therefore presumably double. But in instances 


a Compare 'houli (holy). 

1 Compare bana (made), jana (knew). 

5 Some transcribers of Italian have used length-marks to indicate the long 
consonants of that language: see, for instance, Panconcelli-Calzia, Italiano. 
Others have repeated the consonant letter; see, for instance, Camilli, Italian 
Phonetic Reader.. Those accustomed to use the Swedish Dialect Alphabet 
show the long consonants of Swedish by a length-mark; these consonants 
might equally well be indicated by doubling the letter, at any rate in inter¬ 
vocalic position. Most transcribers of Hindi, Tamil, etc., show consonant 
length by doubling the letter, though the late Dr. Grahame Bailey used a 
length-mark for this purpose in his Panjabi Phonetic Reader. 

• Compar gala (throat), pata (address, sign), rasi (juicy), pda (cooked rice). 
7 Compare baic^ai (I searched) (Armstrong, Phonetic Structure of Somali, 

p. 138). 

• Compare 3atj (HtaTb, to reap). 

• Compare ta (let go), gwa (fall). 
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like the emphatic pronunciation of the French word M m:izerabl 
(tmiserable ) with a long m there is no reason to regard the sound as 
other than merely long. 

368. Final long consonants are conveniently regarded as double 
in all cases where there is reason to think that the native speaker 
feels them to be double, and more particularly when the latter 
part can be regarded as syllabic. It is thus clear that they should 
be regarded as double in such cases as the German 'k0nn, be'zinn 
(contracted forms of 'kanen, konnen, be'zinen, besinnen), since it 
is possible to pronounce them double; they are doubtless felt as 
double by German speakers even if objectively they sound merely 
long. The same applies in the case of compounds such as 'zonnjain 
(Sonnenschein). 

369. So also the final consonants of such Arabic words as 
dakk (sandhill), Ia 33 (to be sweet), darr (abundant), or the colloquial 
ha?? (truth), hubb (love) are doubtless felt by native speakers 
to be double rather than merely long on account of the existence 
of cognate forms such as dakaka, dikaik (sandhills), Ia 3 i :3 (sweet), 
midrairan (abundantly), ha?i:?a (truth), habi:b (beloved). 

370. In languages where no such special conditions have to 
be taken into account there is no reason to consider final long 
consonants as double. This applies not only to languages like 
English, French and Swedish where the lengths of final consonants 
are linked with the lengths of preceding vowels (§§548-553), but 
also to languages like Hungarian and Estonian where the lengths 
of final consonants are not so linked. 

371. Thus since the long final consonants used in English 
when short vowels precede, as in hil (hill), pen (pen), are never 
pronounced double, they need not be regarded other than merely 
long. And as length is invariably applied to final consonants 
when the preceding vowel is short it is not necessary to mark it 
in transcription. The same applies to the final consonants of 
French words like vil (ville), det (dette). In Swedish, although the 
principle is the same, the reverse notation is probably preferable 
on account of the frequent occurrence of long ("double”) con¬ 
sonants medially. The case is illustrated by the Swedish words 
kail (kail, cold), topp (topp, top), lamm (lamm, lamb), sill (sill, 
herring), which may be compared with lam [la:m] (lam, paralysed), 
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sil [si:l] (sil, sieve) which have long vowels and short final con¬ 
sonants. 10 

372. In Hungarian and Estonian differences of duration in 
final consonants are significant, and accordingly the lengths must 
be shown in transcription. The following are some examples from 
these languages with words ending in single consonants for 
comparison. (Although there is no reason to consider the long 
consonants as double, it is nevertheless advantageous from the 
standpoint of legibility to write them with doubled letters) 

Hungarian: hall [hall] {hall, hears) hal [hal] (hal, fish) 

lapp [lapp] ( lapp , Lap- lap [lap] (lap, sheet of 
lander) paper) 

've:gett ['veigett] 've:get ['veiget] (veget, 

(vegett, on account of) end, acc.) 
fo:tt (foil, cooked) f0:t (fot, chief thing, acc.) 

Estonian: si::tt (silt, from here) si::t (slid, silk) 

sa::kk ( saak , booty) sank ( saag , a saw) 

373. There are cases where a long consonant preceding or 
following another consonant in medial position has to be considered 
truly long and not double. They would, however, all appear to 
be cases where particular conditions apply, and which consequently 
do not illustrate any principles of general application. Such are 
the long nasals preceding voiced plosives in Sinhalese (see §260, 
264, etc.) and the long consonants in such Tamil words as maittru 
(change), arttam (meaning), tirppu (sentence). These Tamil 
consonants are not ordinary instances of doubling since in that 
language the length of medial p, t, etc., is associated with the 
presence or absence of voice. Short consonants in the above 
positions would be voiced; thus the word written broadly ontru 
(one) is pronounced [ondru]. It is possible that the native Tamil 
speaker does not think of the above tt, pp as being doubled, since 
the sounds may be considerably shortened, as long as their 
voicelessness is maintained. 11 

10 The phonemic system of Swedish is, however, obscure in several points, 
and I am not at present prepared to suggest a definite scheme for transcribing 
that language broadly. It might turn out that a broad transcription should 
show the lengths of vowels as well as of consonants. To ascertain this would 
require an investigation of the lengths of vowels in weakly stressed positions. 

11 See Firth’s Short Outline of Tamil Pronunciation, pp. iii, v. 
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374. In the case of various medial and final consonant groups 
in Swedish, the first consonant is long, e.g. the I in V vellja ( valja , 
choose), the n in 'innra (inre, interior), the k in 'akksal (axel, 
shoulder) or v akksal (Axel). These clearly have to be regarded 
as having true length and not as being double. (Transcribers 
using the Swedish Dialect Alphabet habitually write them with a 
length-mark.) 

375. It is to be understood then in what follows that the terms 
"long” and “length” are used only in reference to sounds having 
“true" or “indivisible” length as explained in the preceding para¬ 
graphs. Any lengths which can be regarded as doubling or repetition 
do not come within the scope of this enquiry. 

Word Differentiations by Duration 

376. As already mentioned in §§54 and 59, differences of duration 
are sometimes employed for the purpose of distinguishing one 
word from another. Some examples of differentiations by lengths 
of consonants were given in §372. Differentiations solely by 
lengths of vowels are exemplified in the following paragraphs 
(§§377-379). But where vowels are concerned, differentiations by 
length are very often accompanied by differences in some other 
attribute as well, generally tamber. Word distinction effected by 
duration in conjunction with other attributes is discussed in 
Chapter XXVI. 

377. Scottish English makes a limited use of distinctive length. 
In that type of English when inflected forms are constructed by 
the addition of -d or -z to a word ending in a vowel, the vowel is 
normally long. But in non-inflected words containing the same 
sounds the vowels are short. Thus, while brood, greed, road, freeze, 
use, rose have short vowels (broadly brud, grid, rod, friz, juz, roz), 
brewed, agreed, rowed, frees, ewes, rows have long ones (bruid, 
a'griid, ro:d, fri:z, ju:z, ro:z). 

378. Examples of foreign languages in which significant use is 
made of differences in the duration of vowels unaccompanied by 
perceptible differences of any other kind are Japanese, Finnish, 
Somali, Kikuyu, Luganda. Examples of words so differentiated are: 
Japanese isso (rather), isso: (more), soko (bottom), soko: (conduct) 
Finnish tule (come!), tule: (comes), tulle: (might come), tu:le: (it 
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blows), etc.; Somali kul (hot), ku:l (necklace), lab (male), la:b (chest); 
Kikuyu kora (grow up), ko:ra (root out), igoko (bark of tree), igo:ko 
(thick thatching grass), heni (lightning), herni (lies); Luganda okusona 
[.•••] (to sew), okusoina [.*-•] (to take by surprise), okutuma 
(to send), okutuima (to heap up), bagenda [ •.. ] 

(they go), baigenda (they went). 

379. French too makes a certain limited use of significant 
vowel length with little or no accompanying distinction of tamber. 
Examples are m£:tr (maitre), metr ( mettre ), bed (bele), bel {belle), 
tu:s (tons, all of them, us, etc.), tus (tousse). 12 

380. In some varieties of English it would appear that no 
difference is made between the sounds of final d and t, both being 
pronounced d. In the speech of those who pronounce in this way 
words like heed and heat (narrowly hi:^, hi^) are distinguished 
solely by the length of the vowel. H. J. Uldall, a reliable observer, 
considered this pronunciation to be common in America, 13 but the 
point does not seem to have received much attention from American 
phoneticians. 14 

381. Languages which make use of duration for word distinc¬ 
tions, with or without the assistance of other attributes, may be 
called “chrone languages” (see Chap. XXIII). In chrone languages 
there are as a rule only two significant degrees of duration, “short” 
and "long.” Estonian appears to be an exceptional language having 
three significant degrees of duration (see §§399, 419). 

382. Though the duration of vowels is a feature of speech 
which is easily capable of distinguishing words, there exist many 
languages in which it is not employed at all for this purpose. Such 
are Spanish, Russian, Polish, Greek, Persian, Bengali, Marathi, 
Tswana; also the type of English described in §519. 

12 These distinctions are not made by all French people. In the speech 
of some who distinguish long and short £ there is a slight accompanying 
difference of tamber. Some do not distinguish tUIS and tUS, but use a short 
vowel in both words. 

12 Le Maitre Phonttique, October, 1934, pp. 97, 98. 

11 H. A. Rositzke has produced kymographic tracings showing that the 
sounds of final d and t differ (Le Maitre Phonetique, October, 1938, pp. 64-66). 
He adds that " Uldall’s remarks should be taken to apply only to General 
American and Southern speakers, and even then not to all sections and 
speakers of the region.” 
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Semantic Length in the Sentence 

383. Languages exist in which differences in the duration of 
vowels distinguish one kind of sentence from another but not one 
word from another. The Tswana language of South Africa is 
one of these. In this language the vowels (or syllabic consonants) 
of penultimate syllables of complete statements are always very 
long. When the penultimate syllable of a complete sentence is 
short, it means that it is a question or a command or an exclamation. 
A statement can be converted into a question by shortening the 
vowel (or syllabic consonant) of the penultimate syllable. And in 
unfinished statements or non-final parts of sentences all syllables 
up to and including the penultimate are short, the ultimate being 
generally somewhat lengthened. 

384. The following are some examples illustrating the use of 
length in Tswana. They are given here with precise length and 
tone-marking. 16 

Long penultimate: 5 a: ja [ • x , ] (he is eating) 

5 a ja: <j>? [ ‘ • \ v ] (he is eating here) 

5 ja na:ma [' ' v . ] (he is eating meat) 

5 a baitla [ ‘ -x . ] (he is searching) 

5 a bat la: [ ’ • • x \ ] (he is searching here) 

5 batla phaixg [' • • \ . ] (he is looking for 
the cat) 

a batla kxo:m5 [' ■ ■ x • ] (he is looking for 
the cow) 

xo bo:na [• \ .] (to see) 
xo b5na:la [ •' v • ] (to be evident) 

Short penultimate: o a ja ['•_] (Is he eating?) 

5 a p <|>a [' • ’ . ] (Is he eating here ?) 

5 ja nama [''•_] (Is he eating meat ?) 

5 batla phaxg ['■•’.] (Is he looking for the 
cat?) 

5 batla kxomo [’••■'] (Is he looking for the 
cow?) 

15 The tone-marks are as follows: “ high-level, ^ high-fall, low-level, 

mid falling to low; unmarked syllables have mid-level tone. 
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£ ['] (Eat!) 

ja nama [' •. ] (Eat the meat!) 
batla [•'] (Search!) 

batla kxomo [•'■'] (Look for the cow!) 
nanoxa [ • ' . ] (Raise yourself!) 

385. In this language then length is applicable to a certain 
syllable of the sentence, the penultimate, and is not in any way 
attached to particular words or to particular syllables of any words. 
Batla (is searching), for example, may have the first vowel lengthened 
or the second vowel lengthened or may have both vowels short. 


Relationships Between Duration and Stress in 
Non-Stress Languages 

386. It is interesting to compare the case of Tswana with that 
of French, since neither language makes use of significant stress, 
but in both there are relationships between duration and stress. 
The usage in Tswana is as follows. There is strong stress on the 
two final syllables of every statement, except when these syllables 
bear the tone pattern [ N • ]; and, as already mentioned in §383, 
the penultimate syllable of a statement is always long and the 
ultimate always short. 18 Thus if the words phaxe (cat), nama 
(meat), kxomo (cow) occur at the end of a statement both syllables 
receive strong stress, and the vowel of the first syllable is long. 17 
In fact any syllable which in other positions in the sentence would be 
pronounced short has to be given length in the penultimate position 
in a statement. Usage in French is almost the opposite of this. 


18 The ultimate syllable is less objectively prominent than the penultimate 
owing to the shortness and frequent devoicing of its vowel, but strong stress 
is without doubt subjectively present and is shown objectively by the gestures 
(especially of the hands and head) which native speakers employ. 

17 With precise tone and length marking phaiXE [\.], naima [>,.], 
kxoimo [x - ]- The exceptional tone pattern which is associated with strong 
stress on the penultimate and weak stress on the ultimate is exemplified by 
khukhu (beetle), which at the end of a statement is pronounced khlllkhu 
['•]• 

When not final, Tswana words have other tone patterns, for which see 
Jones, The Tones of Sechuana Nouns (published by the International African 
Institute) and Jones and Plaatje, Sechuana Reader. 
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Final syllables in French sentences generally bear a strong stress, 
while (in the absence of special emphasis) non-final syllables have 
weaker stress. Final syllables may have long vowels, but when a 
syllable requiring a long vowel in final position occurs in pen¬ 
ultimate or other non-final situation, its vowel is short. Thus, 
when such words as fce:j (feuille), far ( faire) occur at the end of a 
sentence, their vowels are long; but in non-final positions, as in 
foej da lety (feuille de laitue), fer pair (faire part), their vowels 
are short. 


CHAPTER XXIII 

CHRONEMES 

387. In languages where differences of duration are employed 
for differentiating words, those differences are naturally con¬ 
siderable. But in such languages the “longs” are not always of 
the same duration, nor are the “shorts.” Durations are also 
variable in languages where length has no significant function. 

388. The chief factors associated with variations of duration 
are (1) the nature of the sound itself, (2) the nature of sounds 
adjoining it in the sequence, (3) the degree of stress, (4) the number 
of syllables intervening between one strong stress and the next, 
and occasionally (5) intonation. Illustrations of variations of 
length in accordance with these factors are given in the following 
paragraphs. 

389. (1) In English the close i and the back a, though both 
"long” vowels, are not as a rule of the same length in the same 
phonetic context; the a: is generally longer than the i:. It is easy 
to hear for example that the vowel in halt (heart) is longer than 
that in hi it (heat). 

390. (2) Variations in accordance with the nature of adjoining 
sounds are particularly frequent and noticeable in cases where there 
are associations of duration with quality (§§511 ff.); this can be 
readily understood, since when the differentiation between words 
is partly by tamber, the greater the tamber difference the less 
need is there for stability of length. Several of the vowels in my 
type of English illustrate such variations. Thus my vowels in 
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see, seed, seen, seat, seating, are all “long,” but they have not all 
by any means the same degree of length when said in isolation; 
and further degrees of length are found when the words occur in 
connected speech. 

391. It is easy to estimate roughly by ear the relative length 
of vowels in many cases of this sort, e.g. those in seed, seen, seat 
said in isolation. Accurate measurements of such lengths have 
also been made experimentally by many observers; see §403^06. 1 

392. 'The following is an example of degrees of length observable 
in a consonant. In Italian when t is preceded by a strongly stressed 
vowel + n, it is noticeably longer than a single t preceded by a 
strongly stressed vowel only. Thus the t in 'dante (Dante) is 
longer than that in 'date (give) 2 ; it is in fact almost as long as the 
long (“doubled”) t of 'fatto (made). 

393. (3) In stress languages vowels are usually shorter in non¬ 
final weakly stressed positions than in strongly stressed positions.® 
Thus in English the weakly stressed “long” o: in ko:'zeiJn ( causation ) 
is not so long as the strongly stressed vowel in ko:z (cause).* 
Similarly hi: (he) has a shorter vowel in hi: 'dxznt (he DOESN'T) 
than in 'hi: dxznt (HE doesn’t). 

394. Many examples of shortening in conjunction with weak 
stress are found in French. For instance, as already mentioned 
in §386, faire said by itself has a long vowel (fe:r), but in fair e-part, 
faire savoir, where the syllable is weakly stressed, the vowel is short 
(fer'pa:r, fer sa'vwa:r). 

395. (4) In stress languages there is usually a tendency to 
make the strong stresses follow each other at fairly equal intervals, 
whenever this can conveniently be done. This tendency produces 
the effect commonly termed "rhythm." It often determines the 


1 The first measurements of this kind were made. I believe, by E. A. Meyer, 
and were published in his Englische Lautdauer (1903). It was he who first 
pointed out that the vowel in ha:t can be heard to be longer than that in hilt. 

* Narrower transcription 'da:te. 

* Especially in weakly stressed positions immediately preceding a stress. 
Final weakly stressed vowels are often rather long (see the example city in 
§405). 

* Said in isolation or at the end of a sentence (see §395). 
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lengths of sounds. Thus if a number of weakly stressed syllables 
intervene between a strong stress and the next following strong 
stress in the sentence, various shortenings may take place in the 
unconscious endeavour to make the "stress bar” equal in length 
to other “stress bars." Thus, though the o: of the English word 
ko:z {cause) is very long when the word is said by itself, it is much 
shorter in such a sentence as 9a 'ko:z av it waz 'neva dis'kAvad 
[the cause of it was never discovered). It is for this reason that in 
such words as i'mkdjatli ( immediately ), in'dju:bitabl {indubitable) 
the long vowel is much less long than that of such words as mi:n 
{mean), dju: {due) said in isolation. (See my Outline of English 
Phonetics, §§886-890.) 

396. (5) Lengthening of sounds appears to be a usual accom¬ 
paniment of the English falling-rising intonation when this intona¬ 
tion is applied to a single syllable. Thus I should normally 
pronounce the trai in I'll try [-~] (implying "though I don’t 
suppose I shall succeed”) with a longer diphthong than in the plain 
statement I'll try [ - \ ]. Moreover there are tone languages in 
which special lengths accompany significant tones (see §§557-566). 

397. The lengths of sounds may be modified for purposes of 
emphasis. Emphatic lengthening must be regarded as belonging 
to a special form of the language. It is outside the scope of the 
present chapter. 

398. Any particular degree of duration may be termed a chron t. 
For instance, the duration of the vowel in see said in isolation in 
a normal way, is an example of a chrone. The durations of the 
vowels in seed and seat (which are different from this) are examples 
of other chrones. The relative values of different chrones may 
often be estimated roughly by ear. Their absolute values may be 
measured accurately, if desired, by apparatus in any convenient 
units of time, e.g. hundredths of a second. Thus in my normal 
pronunciation the chrone of see, said in isolation, has been found 
to measure on an average 6 0-317 sec., while the chrones of seed 
and seat have been found to measure on an average 5 0-252 and 
0-124 sec. respectively. 

399. As the actual lengths of sounds are often conditioned by 
phonetic contexts, the various durations (chrones) can be grouped 


• Average of four recordings. 
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together into what may be called chronemes, in the same sort of 
way as qualities (phones) may be grouped into phonemes. They 
differ from the latter, however, in that though there are many 
distinguishable chrones in most languages, there are seldom more 
than two chronemes. The actual lengths occurring in a particular 
chroneme may be called its “members” or “allochrones.” 

399a. Languages like Spanish and Russian where vowel duration 
is not “distinctive,” contain only one vowel-chroneme. The 
languages in which vowel-duration is distinctive (with or without 
accompanying quality differences, §§510 ff.) contain for the most 
part two chronemes only. This is doubtless because it is difficult 
for people with ordinary powers of perception to distinguish with 
certainty more than two degrees of length in a given phonetic 
context. The existence of three chronemes is, however, a possibility, 
and has in fact been stated to be found in Estonian, where three 
chronemes are applicable to both vowels and consonants (see §419); 
it is possible also that Japanese contains more than two vowel- 
chronemes. 

400. When there are two chronemes and they are not associated 
with quality-differences, there must naturally be a condition for 
intelligibility that the ratio between “long” and “short" should 
not fall short of a certain minimum.® Intelligibility is improved 
by increasing this ratio, as is no doubt often done. 

401. Examples of the grouping of absolute lengths into chronemes 
are given in the following paragraphs (§§402-417). §§402-416 
exemplify the principle in languages employing two chronemes; 
§417 contains illustrations from languages with only one chroneme. 

402. The type of Southern English which I speak is an example 
of a language where the differences of length included within the 
chroneme are particularly striking. In this type of English two 
chronemes are employed, a "long” and a "short.” Each of these 
comprises a number of chrones, some of which differ very con¬ 
siderably from each other. When the long chroneme is applied 
to the i-phoneme, a relatively close variety of i is used and the 
differences of chrone in different phonetic contexts are particularly 
noteworthy. In narrow transcriptions it is generally considered 
sufficient to indicate two of these chrones only, namely, "fully 


• What this minimum is has not, as far as I am aware, been determined. 
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long” and "half-long” 7 ; it is, however, not difficult to hear degrees 
of these, and it would theoretically be possible to mark them in 
very narrow transcriptions. 

403. The actual average lengths of the chrones used by me 
in the words mentioned in §390 have been measured and found 
to be as follows: si: 0-317 sec., si:d 0-252 sec., si:n 0-199 sec., 
si:t 0-124 sec., 'siitiQ 0-087 sec. 8 

404. The following have been found to be the lengths of my long 
a: in similar conditions 9 : ka: (car) 0-406 sec., ka:d (card) 0-338 sec., 
ka:m (calm) 0-305 sec., ka:t (cart) 0-254 sec., 'ka:tii] (carting) 0-187 
sec. It will be noticed that the chrones vary in somewhat the 
same way as those of i:, but that they are all a little longer and the 
ratios between them are somewhat less than the corresponding 
ratios for long i:. 

405. When the short chroneme is applied to the English i- 
phoneme, opener types of vowel are used, the variety employed 
in any given case depending upon the phonetic context and par¬ 
ticularly upon the st. ess. These opener sounds may be represented 
in narrow transcription by the letter i. Several chrones can be 
distinguished, but the differences between them are naturally less 
than between those found in the long chroneme. The actual average 
lengths of the chrones used by me in the words Lid, sin, sit, sitting, 
city were calculated in June, 1939, to be as follows: lid 0-135 sec., 10 
sin 0-077 sec., sit 0-085 sec., 'siti q (first i) 0-052 sec., 'siti (first i) 


7 And that only for the English pure or nearly pure vowels. It is not 
customary to mark the lengths of diphthongs, though they vary in the same 
manner as the lengths of pure vowels. (It is inconvenient to mark the lengths 
of diphthongs in writing.) 

8 These measurements and those in §§404-406 were calculated from kymo- 
graphic tracings by Dr. D. B. Fry and Mrs. H. J. Uldall in the phonetics 
laboratory at University College, London, in June, 1939. The figures show 
the average of four recordings of each word. 

Other kymographic tracings of similar words were published by me in the 
first and second editions of my Outline of English Phonetics (Leipzig, 1918, 
1922), p. 179, and in the Proceedings of the Royal Institution, Vol. XXII, 1919, 
pp. 8—18 (report of a lecture on Experimental Phonetics and its Utility to the 
Linguist delivered on 9th Feb., 1917). The measurements of the vowels there 
recorded were as follows: bi: (bee) 0-47 sec., bi:d (bead) 0-325 sec., bi:n (bean) 
0-304 sec., bilt (beat) 0-147 sec., bid (bid) 0-15 sec.,. bin (bin) 0-11 sec., bit (bit) 
0-068 sec. 

8 Including the aspiration of the k. 

18 See §421. 
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0-054 sec., (second i) 0-175 sec. 11 These measurements show relation¬ 
ships similar to those found in the long chroneme, except in sin 
where the vowel is somewhat shorter in proportion. It is not usual to 
indicate any of these differences of chrone in narrow transcriptions. 

406. Similar chrones are found in other cases of the short 
chroneme in my type of English, e.g. a and short o. Thus the 
average lengths of a in my pronunciation of thud., shun, shut, 
shutting have been found to be as follows: 0Ad 0-1405 sec., jAn 
0-110 sec., jAt 0-0928 sec., 'jAtig 0-065 sec. These measure¬ 
ments show relationships similar to those of long i. 12 

407. Stage German also has chrones applicable to vowels which 
can be grouped into two chronemes, a long and a short. There are 
accompanying quality relationships in the case of all the vowels 
except e. 13 Thus the following pairs of long and short vowels 
occur in German: i: i (narrowly u), e: e (narrowly a), 14 e: e, a: a 
(narrowly a), o: o (narrowly a), u: u (narrowly a), y: y (narrowly y), 
0 : 0 (narrowly ce). 

408. The chrones found within’these chronemes are in one respect 
similar to those occurring in corresponding situations in English. 
In German, as in English, the chrones used in unstressed pretonic 
syllables are notably shorter than those in comparable stressed 
syllables: the e: of ledbendig (lebendig) is shorter than the e: of 
■Ie:ben (Leben), just as the a: in the English ai'tistik ( artistic) is 
shorter than that in a:t {art). 

409. In other respects, however, the usage in German differs 
from that in English. In particular there is little difference between 
the chrones used in German before voiced and breathed consonants: 
the lengths of the vowels in 'bi:t3 (biete) and 'bi:na {Biene), 'Jtraisa 
(Strasse) and 'na:za (Nase) are not perceptibly different. 15 

11 It will be observed that final weakly stressed short i is longer than any of 
the others. It is, however, shorter than final weakly stressed long il, of such 
words as 'kouti: {coatee), ■pedigril {pedigree). 

12 The two chronemes are not applicable to all the English vowels. In 
my pronunciation they are not applicable to the a-sounds, and there is some 
doubt as to whether they can properly be applied to 3. (See §§197-202.) 

12 I refer to the type of German in which words like spat, ahnlich are said 
with a long E. 

14 The short e [3] only occurs in syllables with weak stress. 

16 A constant source of mispronunciation for English learners of German 
and German learners of English. 
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410. French makes a limited use of two chronemes. They are 
applicable only to certain vowels and to those only in certain 
situations. Moreover variations of usage are found with different 
French speakers. In one common type of French the two chronemes 
are applicable only to e before final p, t, m, n, I, s, J and j, to u 
before s and j, and to oe before j. Examples: gEip, djcp (guipe, 
Dieppe)-, 3£:m, 3£m ( j’aime, gemme); bel, bsl ( bSle, belle); abEis, 
ab£s ( abaisse, abbesse); tu:s, tus ( tous meaning “all of us,” “of 
them,” etc., tousse); nu:j, fenuj (nouille, fenouil); foe:j, rakcej ( feuille , 
recueil). 16 It is probable that the two chronemes are applicable to 
a, but there appear to be no conclusive examples to prove this; pairs 
like a:t (hale), drwat ( droite ) are perhaps a sufficient indication. 17 

411. The two chronemes are not applicable to the other French 
vowels, a is always short, and the remainder (i, e, a, d, o, y, a, 
£, a, 5 , oe) always have lengths determined by phonetic context. 18 
Nor are the two chronemes applicable to e, u and oe before con¬ 
sonants other than those mentioned in the preceding paragraph; 
before other consonants the vowels are long or short according to 
"rules.” 18 

412. In the situations where the two chronemes are applicable 
in French there is not much variation of chrone within each chroneme. 
Thus the long e’s are approximately of the same length in belt 
(bete), E:m (aime), be:I (bele), b£:s ( baisse ), fre:J ( fratche ); and the 
short e’s are approximately of equal length in bet ( belie ), 3£m (gemme), 
bel (belle), ses (cesse), 19 mej ( meche ). 19 

413. In the circumstances where the two chronemes are not 
applicable in French certain notable differences of chrone are to 


18 Some French speakers do not make these differences in the length of U 
and oe. With others, on the other hand, the two chronemes are applicable 
to i and y in some situations: fi:s, vis (fils, vis), omnibyiS, pys (omnibus, 
puce). With others again the two chronemes are not applicable to E before 
n, S and J, all the words ending in these sounds having e’s of equal length. 

17 On the assumption that the shortness of the a in drwat is not in any 
way conditioned by the presence of the rw. 

18 See the "rules of length” formulated in the books on the phonetics of 
French (e.g. Passy, Sons du Franfais, pp. 62, 63; Armstrong, Phonetics of 
French, pp. 151-153; Nicholson, Introduction to French Phonetics, pp. 175, 
176). 

19 The vowels in cesse and meche are lengthened by some. 
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be found within the single chroneme. The length of i, for instance, 
follows the well-known rule that the long chrone is used in final 
syllables terminating in v, vr, z, 3 , r or j, as in ri:v {rive) , i:vr (ivre), 
bi:z (bise), ti :3 (tige), fi:j {file) ; a short chrone is used in all other 
situations, e.g. in vi {vit, vie), 20 vit {vile), vis {vis) , viji ( vigne ), vil 
(mile), fig ( figue ), vid (1 vide). 21 

414. The distinctive long chrones of such words as ge:p, e:m, 
be:l (§410) are approximately equal to the non-distinctive long 
chrones in such words as re:v (rSve), Je:z (chaise), ne :3 (neige), pe:r 
(fere)] and the distinctive short chrones of djep, 3 em, bel, etc., are 
approximately equal to the non-distinctive short chrones of such 
words as Jef (chef), aseji (enseigne). 

415. French is thus a language in which the distinction between 
long and short chrones is sometimes significant and sometimes not. 
Another such language is colloquial Sinhalese. In that language 
all the vowels with the exception of a are subject to two chronemes: 
examples pi:li (gutters), pili (clothes), | 3 e:!a (half-day), pels (field), 
Eita (distant), eta (there is), ma:la (necklaces), mala (flower), po:ra 
(manure), pora (quarrels), su:du (gambling), sudu (white). In 
final position it is usually immaterial whether a vowel other than 
a is pronounced long or short: the word for "white” is pronounced 
indiscriminately as sudu or sudu:, and the words for “eye,” “small,” 
"more” are pronounced eht, poc|_i, pacja or the:, poc(_i:, pac(a: without 
any apparent system . 22 There are, however, a certain number of 
Sinhalese words in which the length of a final vowel is essential. 
Thus ridi: (silver), ne: (not), gona: (bull), ko: (where, interrog.) 
cannot have their vowels shortened. 

416. In final positions therefore two chronemes are applicable, 
( 1 ) a long chroneme and ( 2 ) what might perhaps be termed a 


20 Reference is here made to the type of French in which feminine endings 
do not involve length of vowel. In some types of French, vit and vie are 
distinguished as vi and vi:. 

21 With some speakers, final b, d, g are also associated with a long chrone 
on the preceding vowel. 

22 If there is any system governing the length of these final vowels it is 
probably to be found in the position of the word in the sentence, and in the 
nature of the sentence. This would be a phenomenon in length corresponding 
to sentence stress in stress languages and sentence intonation in tone languages. 
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“varichrone,” 23 the chrones of which vary from quite short to a 
duration equal to that of the long chroneme. There is in fact a 
kind of "overlapping” of chronemes here. 

417. In the preceding paragraphs we have dealt chiefly with 
length in languages where two chronemes are found. The grouping 
of chrones into a chroneme may, however, also be exemplified from 
languages having only one chroneme. In Russian, for instance, 
it is probable that, other things being equal, the vowels in syllables 
closed by consonants have a somewhat shorter chrone than the 
vowels of open syllables. Thus the vowel of rost (pocT, height) 
is probably a little shorter than the o in sto (cto, hundred). 24 In 
Spanish, according to Navarro TomAs, vowels have fairly long 
chrones when final (strongly or weakly stressed) and when strongly 
stressed and followed by a single consonant followed in turn by 
another vowel. Vowels in other situations (e.g. when strongly 
stressed but preceding two consonants or when weakly stressed 
and not final) have shorter chrones. Measurements are given on 
pp. 155, 156 of Navarro TomAs’s Pronunciation Espaiiola. 

418. From the linguistic point of view the differences of vowel 
length occurring in non-chrone languages are immaterial. One 
result is that length-marks need not be used in practical phonetic 
writing for such languages. Another is that (within limits) 25 though 
the use of incorrect vowel lengths by a foreign learner of such a 
language may cause him to speak with a foreign accent, it will not 
render him unintelligible. 

419. The use of three chronemes in a language is very rare. 
Three chronemes have, however, been stated to occur in Estonian. 
They are exemplified in jama {jama, nonsense), ja:ma ( jaama, of the 
station), ja::ma [jaama, to the station). Three degrees of consonant 
length are also believed to occur, and are exemplified in lina (flax, 


23 See “variphones” (Chap. XXVIII). 

24 Reference is here made to words said in isolation. In connected speech 
the lengths of words are often modified. It is, however, not possible to 
take account of variations of length in the sentence in order to arrive at a 
theory of chronemes, any more than it is possible to take into account varia¬ 
tions of sound occurring in connected speech in grouping sounds into phonemes 
(§34). 

26 I.e. provided that the lengths are not so much altered as to give prominence 
to syllables which should not be prominent. 
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sheet), iinna (of the town), linnna (to the town). 26 These and many 
other examples have been given by L. Krass in her thesis on The 
Phonetics of Estonian (1944) which may be consulted in the Library 
of the University of London. 27 

420. It is probable that in languages which make use of three 
chronemes there is not much variation of chrone within each 
chroneme . 28 Miss Krass has, however, called attention to some cases 
in which the extra long chroneme is not quite as long as usual . 29 

Overlapping of Chronemes 

421. It will be observed that in languages which make use of 
more than one chroneme the lengths belonging to the chronemes 
are liable to overlap a good deal. This is especially the case when 
differentiations are effected by duration in combination with 
tamber or some other attribute. Thus it will be seen from §§403 
and 405 that my short i in lid has a longer duration than my long i 
in si:t. And I think it probable that the German long a in weakly 
stressed pretonic position, as in da:'ne:ben (daneben), va:r'Jainli^ 
(wahrscheinlich) , would, be found to be at least as short as the 
stressed German short a in some positions . 30 It must be emphasized 
that the whole idea of the chroneme is based not on absolute lengths, 
but on relative lengths in similar phonetic contexts. 

Differences of Chrone according to Type of Sentence 

422. It will be seen from what was said in §§383-385 about 
length in Tswana that that language illustrates a very unusual 
type of chronemic usage. There is a chroneme of one kind for 
statements, and a chroneme of quite a different nature for questions, 
commands and exclamatory sentences. It is as if statements 

28 The case of Estonian words like kabi (hoof), kapi (near the cupboard), 
kappi (into the cupboard) is similar. The orthographic b represents a voiceless 
consonant, unaspirated p; the p of kapi represents a lengthening of this, and 
the pp of kappi a further lengthening. The words might be written phonetically 

kapi, kappi, kapppi. 

27 Professor L. Posti, in his article, On Quantity in Estonian (Jour. Soc. 
Finno-ougrienne , LIV, Helsinki, 1950), expressed the opinion that by a special 
classification of the lengths in Estonian the number of chronemes is reducible 
to two. Miss Krass maintains her opinion that there are three chronemes. 

28 Except possibly in a language where two of the chronemes are associated 
with notable distinctions of tamber, if such a language exists. 

18 Thesis on The Phonetics of Estonian, pp. 107, 110. 

80 Weakly stressed pretonic short a, as in ta'niin (Tannin), pa'siivai 
(Passiva), va'loinij (wallonisch), would be shorter still. 
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constituted a special variety of the language different from these 
other types of sentence. In the chroneme applicable to questions, 
commands and exclamations there are no perceptible variations of 
chrone, except that if sentences of these kinds are divisible into two 
or more sense-groups, the final sound of each non-final sense-group is 
generally rather longer than the other syllabic sounds. In complete 
statements, on the other hand, there is likewise a single chroneme 
but it comprises three very distinct chrones, a very long, a half- 
long and a short. The very long chrone is used in the penultimate 
syllable of every sentence, the half-long at the end of non-final 
sense-groups and the short in all other situations. 

Representations of Chronemes in Writing 

423. The theory of chronemes forms the basis for the marking 
of length in broad phonetic writing. No indications of duration 
are needed in writing non-chrone languages. In languages which 
make use of two chronemes the long chroneme has to be shown 
in writing; this may be done either by a mark or by doubling the 
letters denoting the sounds which have the long chroneme. In 
languages like French or Sinhalese in which use is made both of 
significant and of non-significant length, it is not necessary to mark 
the non-significant lengths, even though these lengths may be 
equivalent to those found in the long chroneme. It may, however, 
be found convenient to mark these lengths for the sake of consistency; 
this does not narrow the transcription. Languages which make 
use of three chronemes require marks for two of them; alternatively, 
and perhaps preferably, the chronemes may be represented by 
single, double and treble letters. This latter plan is followed by 
L. Krass for Estonian (§419). 

CHAPTER XXIV 

STRESS, PROMINENCE 

The Nature of Stress 

424. It is possible to pronounce a given syllable with various 
degrees of force (strong, medium, weak, etc.) and with various kinds 
of force (level, crescendo, diminuendo or crescendo-diminuendo). 

425. Force of utterance, abstracted from the other attributes 
of speech-sounds, is termed stress. Stresses are essentially 
subjective activities of the speaker. A strongly stressed syllable, 
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for instance, is one which he consciously utters with greater effort 
than other neighbouring syllables in the word or sentence. Special 
visible gestures, and especially movements of the head, arms and 
hands, often accompany the utterance of strongly stressed syllables. 
The application of strong stress involves in fact a special effort of 
the whole body; it is not confined to the speech mechanism. 

426. To the hearer degrees of stress are often perceived as degrees 
of loudness. If a sound or syllable is said several times with varying 
degrees of stress in similar phonetic contexts, it appears to be 
less easy for the hearer to compare the degrees of its loudness than 
it is for the speaker to compare the degrees of effort he uses. 

427. In actual language it is often difficult, and may be im¬ 
possible, for the hearer to judge where strong stresses are. One 
reason for this is that the loudness or carrying power of a sound 
is largely a matter of its inherent quality (tamber), 1 and that in 
consequence a sound pronounced with strong stress may be less 
loud than another sound pronounced with weaker stress. Another 
reason is that in practice strong stresses are usually accompanied 
by one or more other speech attributes which contribute towards 
making the syllable more clearly heard (see §§446, 450, 456, 457). 2 

1 Thus, if vowels of the u and a types are pronounced with what the speaker 
feels to be the same degree of effort, the a is found to be much louder than the 
U; it can be heard at a greater distance, and it comes out more clearly over a 
microphone. (See also §437, and footnote 12 to §204.) 

Loudness must be distinguished from “penetration.” A sound is said to 
be "penetrating” when it can be heard when other sounds are going on 
simultaneously. It “penetrates” those other sounds. Penetration is not an 
attribute of the sound itself; it is dependent upon the nature of the other 
sounds through which it is heard. Very soft sounds may penetrate loud 
sounds of a very different quality, that is to say, they may be clearly audible 
as separate sounds while the loud sounds are in progress. Thus, a weak s 
can be heard while an organ of considerable power is being played, although 
the same sound could not possibly be heard in the neighbourhood of an 
escape of steam from an engine. Conversely, the qualities of many organ 
flue stops, though loud and distinctive when played alone, do not “penetrate” 
the qualities of other organ stops. They blend with them and alter the 
quality of the total sound, but are not heard separately by ordinary people. 
It needs an expert to tell what stops are employed when a flue tone is being 
produced by the use of several stops. 

2 See my Outline of English Phonetics, footnote to §909, where it is pointed 
out that a strong stress may occur on a sound incapable of receiving any 
noticeable increase of loudness. 
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Stress Languages 

428. Languages in which meaning depends in any degree upon 
types of stress or upon the location of strong stresses in sequences 
of syllables are termed “stress languages.” They fall into three 
categories, (1) those in which the location of strong stresses in 
words of more than one syllable is an integral part of the pronuncia¬ 
tion of words, (2) those in which the use of special types of stress 
is an integral part of the pronunciation of words, and (3) those in 
which strong stresses are used in sentences but do not have fixed 
positions in particular words. 

429. Stress languages of the first category are numerous. 
Among them are English, German, Russian, Spanish, Provencal, 
Danish, Hungarian, Icelandic, Welsh, Greek, Swahili. In these 
languages a given word always, or generally, has strong stress on 
a particular syllable. In some of them words of more than one 
syllable may be differentiated by the position of the strongest 
stress. 

430. Languages of the second category are .Serbo-Croat and 
(probably) Somali. 

431. The best known language of the third category is 
French. In this language, all the syllables of groups containing no 
emphatic word have approximately equal stress except the last 
of the group which has a somewhat stronger stress than the others. 3 
Tswana is another such language. As was pointed out in §386, 
the usage there is that the two last syllables of every complete 
statement are strongly stressed, except when these syllables have 
the tonal pattern ['■'•], in which case the penultimate only is 
stressed. 

432. In stress languages emphasis, whether for intensity or 
for contrast, is generally shown by means of special strong stresses. 
In languages of the first and second categories emphasized words 
of more than one syllable generally receive a stronger stress than 
usual on the syllable normally stressed, and emphatic mono¬ 
syllabic words receive strong stress in comparison with other words 
near them in the sentence. In French, which is a stress language 

3 It is difficult, or perhaps impossible, to prove this statement. It is, 
however, common experience that good results are obtained in teaching 
French pronunciation if the teaching is based on this assumption. 
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of the third category, words requiring emphasis for intensity have 
a strong stress; if they are of two or more syllables, the stress falls 
on the first or second syllables according to well-known principles 
which we need not recapitulate here. 4 French words of more than 
one syllable requiring emphasis for contrast generally have a strong 
stress on the last syllable. 5 I have had no opportunity of making 
precise observations on emphatic words in Tswana, but it seems 
likely that the effect is produced by increasing the tonal intervals 
and not by any special stress. 

433. Many languages exist in which stress variations can 
hardly be said to exist at all. They may be called '‘stressless 
languages.” Such are Japanese, Hindustani, Marathi. 

Stress and Prominence 

434. In all languages, including those in which the speakers 
make no conscious use of stress, syllables occur here and there 
in connected speech which are heard to be more "prominent” than 
others—syllables which stand out from other syllables near them 
in the word or sentence. Prominence is an effect perceived object¬ 
ively by the hearer. It is thus quite a different thing from stress, 
which is a subjective activity on the part of the speaker. 

435. Prominence of a syllable may be due to strong stress, 
but it may also be due to other features of pronunciation and 
particularly to the inherent quality of sounds, to the length of 
syllables (which may be occasioned either by the length of sounds 
in them or by their number) or to intonation. Different ways of 
effecting prominence are exemplified in §§436-447. 

Prominence by Stress 

436. To appreciate the degree of prominence which can be 
given by stress alone one must pronounce a sequence of similar 
syllables on a monotone or in a whisper, putting the strong stress 
first on one syllable and then on another. A sequence such as 
ninininin serves the purpose, thus 'ninininin, ni'nininin, nini'ninin, 
ninini'nin, or pairs of English words like bi'gin [begin), 'bigin [Biggin), 

1 They are set out particularly clearly in Chapter XIX of Coustenoble 
and Armstrong's Studies in French Intonation. 

6 See Coustenoble and Armstrong, Studies in French Intonation, Chapters 
XXIII, XXV. 
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ri'mit (remit), 'limit (limit). Experimenting thus it will be found 
that the positions of the strong stresses are always clear to the 
speaker, but are not always unmistakable to the hearer if the 
speaker is careful to make the stressed and unstressed vowels as 
nearly as possible of the same length. The fact that an English 
hearer perceives, or thinks he perceives, the chief prominence to 
be on the second syllable in ri'mit and the first syllable in 'limit, 
is, I believe, largely due to the fact that he knows the words and 
knows how he would pronounce them himself. It does not 
necessarily follow that foreign hearers would regard the same 
syllables as being prominent. Indeed, in my experience, many 
foreigners fail to perceive English stress correctly. This applies 
more particularly to those whose mother tongue is a stressless 
language such as Japanese or Hindi. 

Prominence by Tamber 

437. Prominence effected by vowel quality may be illustrated 
by the following experiment, which was devised by A. Lloyd 
J/mes. The English word mi'kanikali (mechanically) has strong 
str. ss on the second syllable and weak stresses on the others: If 
one whispers the word (to eliminate intonation) and pronounces it 
in an artificial manner with strong stresses on the mi and ni and 
weak stress on ka, thus 'mika'nikali, it has been found that hearers 
still think they hear "stress” on the ka. (I have often tried this 
experiment.) Actually what the hearers perceive is the prominence 
of the syllable ka. The prominence of the English short a quality 
so outweighs that of the short i that it is well-nigh impossible to 
render an i more prominent by stress alone. 8 I believe that this 
prominence of the English short a as compared with the short i 
is inherent, and would therefore be observed by foreigners un¬ 
familiar with the word as well as by the English hearers with whom 
the experiment is tried. 

Prominence by Length 

438. Prominence effected by length may be observed in stressless 
languages like Sinhalese and Tamil. Let the reader say aloud the 
following Sinhalese words on a monotone with as nearly as possible 

* In voiced speech the syllables mi and ni could be made more prominent 
than ka by means of stress coupled with intonation, e.g. by applying the 
subjective intonation 
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equal stress on each syllable, but observing carefully the vowel 
lengths and the double consonants. It will be found that hearers 
will easily perceive the prominence of the long syllables. 


saigata (famine) 
ekkenek (one person) 
pip passer) (behind) 
barakaratte (cart drawn by 
two bulls) 

attoraloisua (watch) 

Similar examples from Tamil 7 8 
ailamaram (banyan tree) 
vi:|;u 10 (house) 
paili (lizard) 
papttukkol 12 (read) 


palaita (district) 
hediima (the making) 
iijapasse (afterwards) 
nida:ganna|Ba (I sleep) 

a:rassa:karana(3a (I protect) 7 

vipattu 9 (calamity) 
pafein 11 (I did not endure) 
irukkira:n (he is) 


439. The prominence which causes consonants to be syllabic 
is generally produced by length. That this is so is shown by a 
comparison of such English words as 'med|a [meddler) and ‘media 
[medlar), 'kodjig [coddling) and 'kodlig [codling), 1 bAtni q [buttoning) 
and ‘pAtni [Putney). The j’s and ij of 'med|a, 'kodjig, 'bAtnig 
are weakly stressed, but the sounds are much longer than the 
I’s and n of the other words. It is this length that makes them 
prominent enough to strike the ear as forming syllables. 


Prominence by Intonation 

440. Syllables are sometimes rendered prominent by special 
sequences of voice-pitches. Such a sequence is [-\ _], in which 
the second syllable is liable to sound more prominent than the 
others. The extent to which intonation alone is capable of rendering 


7 These examples are taken from Perera and Jones, Colloquial Sinhalese 
Reader. I have come to the conclusion that the view expressed by me (in 
1919) on page 14 of that book, that there is a tendency to put stress on long 
syllables of Sinhalese words, is incorrect. I believe I mistook prominence 
for stress, and am now very doubtful if there are any real stress distinctions 
in the Sinhalese language. 

8 Examples taken from J. R. Firth’s Short Outline of Tamil Pronunciation. 

’ Narrowly vibattUI. 

10 Narrowly vi:c|_UJ. 

11 Narrowly pattern. 

la Narrowly pacpttukkol. 
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a syllable prominent without the aid of stress or some other attribute 
is a subject which merits detailed investigation. Here I am only 
able to give some preliminary notes on it (§§441-445). 

441. Experiments appear to show that, within limits, no 

particular voice-pitch is inherently more prominent than another; 
for instance, it cannot be laid down as a general principle that 
high pitches are more prominent than lower ones. The reader may 
test this for himself by experiments such as the following. Pro¬ 
nounce a syllable, say la, on alternate moderately high and 
moderately low level pitches, thus keeping the lengths 

and stresses as equal as possible. A listener will find it quite 
easy to “hear” the higher pitch to be the more prominent. On 
the other hand he will find it equally easy to perceive the lower 
pitch as the more prominent. It is possible indeed that people 
of different nationalities may differ in their perception of the 
prominence of high and low pitches, attributing prominence to 
syllables which would have stress if a particular sequence of intona¬ 
tions were to occur in their native language. If the speaker wishes 
to give unmistakable effects of prominence in the above intonation 
sequence, he must either stress certain syllables, e.g. 

or or etc., or lengthen them, thus 

or or etc. 13 

442. So again, if a sequence of three syllables, say an invented 

word barana, is said with equal lengths of the vowels and equal 
stresses and with such an intonation as [•’.], it does not seem 
that any one of the syllables is more prominent than any other; 
in fact, it is not difficult to “perceive” any one of the syllables as 
the most prominent. Nevertheless a French hearer might well 
think he perceived prominence on the final syllable of this sequence, 
while an English or German hearer would be more likely to attribute 
prominence to the second syllable. The same kind of thing is found 
if barana is pronounced with other sequences of level intonations 
such as any syllable may be perceived as 

prominent, and hearers of one nationality may draw conclusions 
different from those of another nationality. 


13 In this graphic system large dots represent syllables pronounced with 
stronger stress than those represented by small dots. The horizontal lines 
represent long syllables pronounced with the same degree of stress as those 
indicated by small dots. 
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443. It seems, however, that rhythmic repetitions of a particular 
pitch give to most hearers the impression of prominence of that 
pitch under certain conditions. When, for instance, a high pitch 
is repeated at equal intervals of time and there are several inter¬ 
vening mid and low pitches, the high pitch may appear prominent, 
and it may require a certain effort of imagination to “perceive” 
prominence on one of the low syllables unless it is made prominent 
by length or stress. The following sequence of tones furnishes an 
example: [••••'•••.•• etc.]. The same appears to apply to 
low-pitched syllables occurring at equal intervals of time among 
high syllables, e.g. etc.]. It needs a still greater 

effort to perceive prominence on mid-pitched syllables surrounded 
by rhythmically placed high and low syllables. 14 


11 Prominence of mid-pitched notes situated between higher and lower 
pitched notes is often required in music. Thus, if the passage 





etc 


is played on an organ legato and without any phrasing, it appears to be easier 
to perceive it 




than as 




etc. 


(The passage is taken from Bach's Partita in E minor for violin, where the 
actual rhythm is the last of these three.) 

Similarly it needs much concentration to keep in mind the rhythm indicated 
in the writing of such a passage as the following from Kreisler’s Praeludium 
and Allegro for violin 


nfM 



P -M-M- ' 

£ 






== 


yd 


i ~ 111 

s 


etc. 


on account of the prominence of the high pitches occurring on every third 
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444. Falling and rising tones appear to have a certain inherent 
prominence. This can be tested by saying a sequence of long 
and equally stressed syllables, say ba:ra:na:, with intonations such 
as [\__], [-\_], [-\], [_\-], [/-“], ["/■], ["/]. 
["/_]. To me, and no doubt to others, the falling and rising 
tones in these sequences have a certain prominence as compared 
with the level tones. Concrete examples of such tone sequences 
are to be found in such a language as Burmese, where successions 
of long and equally stressed syllables are common. Thus in the 
Burmese expression _pja: s pp: Je: _3i: (he thus replied) the 
syllable s pjo: appears more prominent than the others. 

445. Summarizing what has been said in the preceding para¬ 
graphs we may say that prominence by intonation alone is rare in 
speech, and is perhaps confined to falling and rising tones on single 
syllables and to cases where level tones occur in some sort of 
rhythmic pattern. If a syllable has to be made prominent in other 
circumstances, the intonation used must probably be accompanied 
by some different reinforcing attribute. 

Prominence by Combinations of Attributes 

446. As syllables can have prominence through any one of 
the means mentioned in §§436-444, so they can be and often are 
rendered prominent by combinations of them. A common com¬ 
bination is, for instance, strong stress together with special intona¬ 
tion; this is the usual means of giving prominence in stress 
languages. 15 Sometimes the combinations serving to effect 


note, which are liable to give the hearer the impression of a passage in triplets. 
The passage is one where, if it is played exactly as written, the prominences 
indicated in the printed music can hardly be otherwise than subjective. I 
believe it to be impossible for a listener unacquainted with the passage to 
perceive them. The prominences which are presumably intended can in 
fact only be apprehended by those who know beforehand where they are 
supposed to be. 

Performers often give indication of the correct prominences in such passages 
by means of rubato or slight prolongation of some of the notes intended to 
be prominent. I have noticed, however, more than one eminent violinist 
who does not give any such indication in the above passage. In fact this 
passage is so rapid that it would hardly be possible to do so. 

15 See various works on English intonation, e.g. Kingdon’s Groundwork of 
English Intonation, Armstrong and Ward’s Handbook of English Intonation, 
or Chapter XXXI of my Outline of English Phonetics. 
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prominence are very subtle, and it is difficult to determine what 
attribute, if any, is the dominating one. Languages in which 
significant distinctions are made by such means are Somali (see 
§§580-582), Serbo-Croat (§§466, 558-560, 583) and, perhaps, 
Hindustani. 

447. A syllable which is prominent by reason of one attribute 
may be less prominent than one would expect on account of some 
other kind of prominence in a neighbouring syllable. Examples 
may be observed in many words of the Estonian language, such as 
'puhatta (puhata, to rest), 'kahekksa (kaheksa, eight, m. sg.), 
'vapastap (vabastab, he sets free), 'vapantamisessse ( vabandamisesse , 
apology, inessive sg.). The first syllable of these words, as of 
nearly all Estonian words, has strong stress, and Estonian speakers 
feel these syllables to be the most prominent in the words. Some 
of the subsequent syllables have, however, a certain prominence 
by reason of the number of sounds in them and their consequent 
length. To me as an outsider unfamiliar with the language, but 
listening to the words objectively, the prominence of these other 
syllables often seems greater than that of the stressed syllables 
which are really the most prominent to the subjective feeling of an 
Estonian speaker. 

Need for Distinction Between Prominence and Stress 

448. Syllables rendered prominent otherwise than by stress are 
liable to be mistaken for strongly stressed syllables by foreign 
hearers unfamiliar with the words. It is, for instance, very easy 
for an Englishman to imagine that Sinhalese speakers put strong 
stresses on the long syllables of the words quoted in §438. (Readers 
are recommended to try experiments with them.) Likewise some 
sequences of voice-pitches are liable to give to English observers 
the effect of strong stress on a syllable, when they are such as would 
be associated with strong stresses in English. English hearers 
would be liable, for instance, to form the opinion that the syllable 
s pja: in the Burmese example in §444 is stressed, which I do not 
think it is. 

449. The following is an example where a combination of tones 
with lengths gives to the English hearer the impression of stresses. 
There are prominences, but my observation is that the first two 
prominences have no stronger stress than the less prominent 
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syllables adjoining them. The example is from the Tswana 
language, a language which I find to be stressless except for 
the two final syllables of statements (for which see §§386, 431). 
The last clause of the Tswana version of the Lord’s Prayer is 
kaa bo slna bokhutb. With precise tone and length marking 
this would be written kaa bo sina bokhuitb. The length of 
the vowels of kaa 16 and -khu:- and the rhythmical placing of the 
high-toned -si- give a certain prominence to these three syllables, 
and the effect to English hearers is that they are stressed. The 
fact that kaa and si are not stressed can be demonstrated by the 
experiment of saying this phrase with strong stresses on the first 
bo and on na, while keeping the correct tones and lengths. However 
much stress one puts on the bo and na, English listeners will continue 
to “hear stresses” on kaa and si, unless they see the speaker’s 
gestures. 17 

Difficulty of Determining Stress 

450. From what has been said it will be seen that it is often 
difficult, if not impossible, to compare objectively by ear the 
stress of one syllable with that of another. In the first place, as 
has been pointed out in §437, a sound pronounced with strong 
stress is often less loud than another sound pronounced with weaker 
stress. Secondly, as in practice strong stress is almost always 
accompanied by a special intonation or some other attribute, one 
often cannot say for certain how much of the prominence is to be 
attributed to the stress. I therefore do not anticipate that it will 
prove possible to determine degrees of stress experimentally from 
sound records. It is conceivable that they might be determined 
experimentally from the movements of a speaker (muscular con¬ 
tractions of the chest wall, etc.) but little has hitherto been achieved 
in this direction as far as I am aware. 

451. It will be seen further that to ensure correct phonetic 
analysis of foreign languages it is essential that the observer should 
differentiate correctly between strong stress and prominence 
effected by means other than stress. It is a natural tendency of 
English people and speakers of other stress languages to attribute 
all prominences to stress alone. As most observers have hitherto 

le This doubled vowel is indistinguishable acoustically from a long vowel. 

15 These gestures indicate the ordinary strong stresses on the two final 
syllables, -khyi- and -tb, as well as the special stresses on bo and na. 
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not been on the look-out for the distinction between stress and 
prominence, it is necessary to be careful in accepting statements 
as to stress in non-European languages, and we must be ready to 
review them in the light of what is now known of prominence. 

Stress as a Significant Attribute in Words 

452. One function of stress is to give emphasis to words, as 
already mentioned in §341 ff. This is a use of stress in the 
sentence, strong stress being employed to make a word or part of 
a word stand out in order to give special intensity to its meaning 
or to show that it is in contrast with something. It is probable 
that this use of stress is to be found to a certain extent in stressless 
languages as well as in stress languages. The employment of stress 
for purposes of emphasis does not come within the purview of this 
book. We are here only concerned with the other functions of 
stress, namely, as an essential feature of the pronunciation of 
words and as a means of distinguishing one word from another, 
as is done in stress languages of the first and second categories 
mentioned in §§428-430. 

453. English, as is well known, is a language in which stress 
is used in this way. When I say the word 'limit (limit) I con¬ 
sciously put strong stress on the first syllable, and when I say 
ri'mit (remit) I consciously put it on the second. Various pairs 
of English words are distinguished by the position of the strong 
stress. Such are 'inkriis (increase, n.), in'krirs (increase, v.), 'import 
(import, n.), im'port (import, v.), 'insAlt (insult, n.), in'sAlt (insult, v.), 
'torment 18 (torment, n.), tor'ment (torment, v.), 'rirfAnd (refund, n.), 
'rir'fAnd 19 (refund, v.), 'biiou (billow), bi'lou 20 (below) 21 

454. Examples from other languages of distinctions of words 
by stress alone are: German 'virder|ho:len (fetch back), [Virder'horlen 
(repeat) (both words written wiederholen), 'yrber t blike (plural of 
Uberblick), iyrber‘blike (1st person present of iiberblicken); Spanish 
'aXa (halla, he finds), a'Xa (alia, there), 'termino (end), ter'mino 

18 Pronounced by some 'tormant. With them the pair is not a case in 
point. 

19 Pronounced by some ri'fXnd. With them the pair is not a case in point. 

20 Pronounced by some ba'lou. With them the pair is not a case in point. 

21 Pairs of words, such as those written present . subject, recount, are not 
cases in point since they show differences of sound quality as well as stress: 

'preznt, pri'zent, ‘sAbd 3 ikt, sab^ekt, 'rir'kaunt, ri'kaunt. 
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(I finish), termi'no (he finished); Provencal 22 'laggi (sorrow), lag'gi 
(to languish); Greek 1 poli (city), po'li (much), 'ennja (worry), 
en'nja (nine). 

455. It will be observed that in all such languages the words 
distinguished by the position of strong stress alone are comparatively 
few in number, and that a substantial proportion of those that 
exist have such meanings that they would not be liable to be 
confused in any ordinary context. This peculiarity is, I believe, 
attributable to the fact that stress alone, without the accompaniment 
of some other distinguishing feature does not constitute a very 
effective means of differentiating words. The effort of pronouncing 
syllables with strong stress is clearly felt by the speaker, but the 
resulting prominence is not always easily perceived by hearers. 

456. Such experiments as have been carried out support this 
view. Thus N. C. Scott’s experiments described in his article 
An Experiment on Stress Perception in Le Maitre Phonetique, July, 
1939, p. 44, tend to show that when intonation is eliminated, stress 
distinctions are difficult for the hearer to recognize. He concludes: 
“there seems to be a strong indication that stress, unaided,, is not 
very efficient as a distinguishing feature in English.” (See also §436.) 

457. In practice strong stresses are nearly always accompanied 
by intonations which help to render the syllables prominent. If, 
for instance, a Southern English speaker is asked to say the words 
'impoit and ira'poit in a normal manner with a falling intonation, 
he will in all probability distinguish them by intonations of the 
types [ \ . ] and [ • \], If asked to say them in a questioning 
manner, he will probably distinguish them by intonations of the 
types [ / ] and [ • / ] And when such words occur in connected 
speech, they likewise nearly always have distinctive intonations. 
(These are sentence intonations, not word-tones. See §§571, 572.) 

458. The combination of stress with appropriate intonation is 
a particularly effective means of rendering syllables prominent, 
and often gives prominence outweighing that of neighbouring 
syllables which have considerable length or other attributes making 
for prominence. In the word 1 pedigrii, for instance, said with a 
falling intonation [ \ . _ ], the strong stress coupled with the high- 

22 Dialect of Arles, as described in H. Coustenoble’s Phonetique du 
Provenfal Moderne, whore other examples will be found (p. 137). 
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falling intonation renders the first syllable much more prominent 
than the last syllable which has a vowel several times as long as 
the e of the first syllable. (Nevertheless cases are sometimes 
found where the combination of strong stress and prominent 
intonation does not give unmistakably the effect of greatest pro¬ 
minence to the syllable affected. See §447.) 

459. The linking of stress with special intonations in English 

is so frequent and so necessary that the opinion has been expressed 
by some that stress is immaterial, or at least negligible, and that 
all necessary prominence is given to syllables by intonation only. 
The language was recorded on this assumption in H. E. Palmer’s 
book English Intonation. My practical experience in teaching 
English to foreign learners has shown me that there is much to 
be said in favour of this view. I find that a foreign learner can 
almost always acquire a pronunciation recognizable as normal 
English by learning the intonation only and ignoring the stress. 
I find too that the reverse is not the case: a really good accent 
cannot be acquired by a foreign learner who pays attention to 
stress but ignores intonation. But I do not find it possible to 
ignore stress altogether. There are rare cases where an English 
intonation without the needful stress would give an un-English 
effect. An example is where a word stressed on the first syllable 
is given the common intonation [ .\ ] denoting surprise or 
incredulity, e.g. 'riali [ .\ ] (Really?), 'jelou [ ] (Yellow?), 

etc., or one form of emphasis, e.g. 'veri [.\] (Very). Here 
the stress on the first syllable is so strong as to overpower the 
prominence produced by the high-falling intonation on the final 
syllable. Other instances are to be found where strong stress 
occurs in the middle of a continuous intonation, rising, falling 
or level; it is the case dealt with by N. C. Scott in the article 
referred to in §456. Insistence upon stress is also necessary in 
teaching the rhythm of English to foreign learners. (See further, 
§§570, 571.) 

460. It would seem that in other stress languages, such as Italian, 
Spanish, Russian and Greek, differences of stress are likewise almost 
always accompanied by special intonations. In Russian differences 
of stress appear to be always accompanied by differences of sound- 
quality as well as by special intonations (see §536). 

461. Owing no doubt to the elusive character of stress from 
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the point of view of the hearer, 23 most stress languages employ 
only two degrees of stress ("strong” and "weak") for effecting 
word-distinctions. I have only heard of one language, Serbo- 
Croat, which employs three degrees (“strong,” "medium” and 
“weak”) for this purpose. In this language words may be distin¬ 
guished by the position of the strong or medium stresses relative 
to the weak ones; thus ( veseli (gay, nom. pi.) and ve,seli (he 
entertains) are different words. Words may also be distinguished 
according as a syllable has strong or medium stress: compare 
'gora (worse), igora (wooded hill), 'pretvori (he changes), ipretvori 
(he changed). Even monosyllables in this language have appropriate 
degrees of force when emphasized. Thus when some words, e.g. 
'sad (now), 'tsar (emperor) have to be emphasized, strong force is 
used, but when other words, e.g. ( rad (voluntarily), t zar (then), 
ipas (dog), have to be emphasized, they are said with only medium 
force. 24 Presumably there exist some monosyllabic words in the 
language which are distinguished from each other solely by the 
degree of force used when they are emphasized. 

462. In ordinary stress languages, i.e. those in which the location 
of strong stresses is an essential feature, it is generally considered 
possible to distinguish and locate “secondary stresses,” i.e. degrees 
intermediate between the strongest and weakest stresses. I think, 
however, that the so-called intermediate degrees of “stress” are 
as a rule degrees of prominence due to tamber, sound-groupings, 
length or voice-pitch, or combinations of these, with or without 
the accompaniment of stress. However that may be, it would 
seem that in languages like English or German words are never 
distinguished by the positions of secondary stresses alone, 25 and 

23 Stress being, as already observed in 5425, mainly subjective and only 
partially perceived as loudness. (See my Outline of English Phonetics, §909.) 

21 The above information concerning Serbo-Croat was given me by 
Professor D. B. Fry. 

26 I looked out for cases of this while preparing my English Pronouncing 
Dictionary, but I discovered only one instance, and that a very doubtful 
one, namely, the two meanings of certification. It is barely possible that in 
the sense of "granting a certificate” the word might be pronounced 
saiitifi'keijn by some, thus distinguishing it from t salt if i' ke i Jn meaning 
"act of certifying”; I have, however, only heard (Saitifi'keijn for both 
senses. It is, moreover, probable that speakers, if any, who put a secondary 
stress on the second syllable would pronounce S3|tifi'keijn with short 0, 
in which case the word would cease to be a case in point. 
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that therefore we need not concern ourselves with them here. 
The semantic function of more than two degrees of stress in English 
appears to be confined to sentences, or to compound words of a 
type that cannot in my view be taken into consideration in the 
investigation of phonemic distinctions, 26 any more than the functions 
of English intonation can be. The position of secondary stress 
has a certain minor importance in that in a relatively small number 
of words a displacement of it may give an unnatural though not an 
unintelligible pronunciation. 27 

Kinds of Stress 

463. There exist, as mentioned in §424, different kinds of stress 
as well as different degrees of stress. They are 

(1) level stress within the syllable, i.e. where the same degree 
of force is sustained throughout the greater part of the 
syllable, 

(2) crescendo stress, i.e. where the force increases during the 
course of the syllable, 

(3) diminuendo stress, i.e. where the force diminishes during the 
course of the syllable, 

(4) crescendo-diminuendo stress, i.e. where the force increases 
and then diminishes within the syllable. 28 

Each of these kinds may be uttered with different degrees of force, 
but the difference between the kinds of stress can only be made 
clear when the degree of force is considerable, i.e. “strong” or at 
the least “medium.” If the degree of force is “weak,” differences 
in the kind of stress cannot as far as I know be made perceptible 
to hearers. 

464. The above terms, though fairly intelligible, contain 
elements of vagueness. We know how many syllables there are 

26 Such as movie-auditorium, which was quoted by Trager and Bloch in 
their article The Syllabic Phonemes of English, or elevator-operator quoted in 
Trager and Smith’s Outline of English Structure, p. 39, as words containing 
four essential degrees of stress. 

27 Even the common foreign pronunciation |eksami'neijn for the English 
igizami'neijn ( examination ), which contains an error of sound as well as 
of stress, is not unintelligible, though it is not our way of saying the word. (See 
Outline of English Phonetics, §942.) 

28 There cannot be a diminuendo-crescendo stress within a syllable. Such 
a type of stress necessarily produces a sequence of two syllables. 
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in a word by counting its peaks of prominence, but we cannot 
define with precision the points at which the syllables begin and 
end. Still less can we define the limits of the portions of syllables 
which are to be considered as bearing the strong stress. 29 And if 
we often cannot tell what the whole of a syllable is, it evidently 
cannot be stated as a general principle that stress affects entire 
syllables. Moreover syllables often contain consonantal sounds 
which are by nature weak and can hardly be said to be capable 
of receiving degrees of stress. 30 However, in spite of this lack of 
precision, the general meaning of the terms used in the last paragraph 
is sufficiently clear to justify using them in practical phonetic work. 

465. The most usual kind of stress would appear to be the 
diminuendo type. English strong stress is generally considered to 
be of this type, though it might be maintained that crescendo- 
diminuendo stress is used in syllables such as jard {yard), wel {well) 
which begin with a weak consonant; it depends upon where one 
considers the point of incidence to be. The same applies to German, 
Russian and other stress languages. 

466. It seems that level stress, crescendo stress and crescendo- 
diminuendo stress all occur in Serbo-Croat and its dialects. Professor 
Fry has informed me that in normal Serbo-Croat the difference 
between crescendo and crescendo-diminuendo stress is used for word- 
differentiations in conjunction with special tones (the rising and 
rise-fall respectively) and with length (moderately long and very 
long respectively). The words for “young” and “bride" are 
examples of this given by Fry and KosTid in their Serbo-Croat 
Phonetic Reader (p. 8). They write them phonetically with the 
notation mla’da and ,mla:da respectively; this denotes that the 
first syllable of the first word has a rising tone, a moderately long 
vowel and crescendo stress, and that the first syllable of the second 


29 I.e. where the “incidence” of the stress is, and how long the stress lasts. 
Tapping experiments with a kymograph appear to show that if a strongly 
stressed syllable consists of a strong consonant followed by a vowel, people 
more usually feel the stress to begin at some point in the consonant at any 
rate in English. But theTe are exceptions. 

30 Sometimes an increase of force with which a consonant is pronounced 
may convert it into another consonant E.g. it is force only that distinguishes 
voiceless b (b) from unaspirated p. Semi-vowels, such as j and w, would 
appear to be essentially unstressable. 
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word has a rise-fall tone, a very long vowel and crescendo- 
diminuendo stress. 31 G. L. Trager has also noted the existence 
of the two kinds of stress in Serbo-Croat; he calls them "rising” 
(= crescendo) and “falling” (= diminuendo). 32 He also remarks 
that the crescendo stress may have level stress as a variant. 33 He 
has found similar stresses in a Serbo-Croatian dialect. 34 

467. It is doubtless the case that in such a language as Serbo- 
Croat the amounts of decrease and increase of force in the diminuendo 
and crescendo stresses vary to some extent according to the phonetic 
context, e.g. according to the types of stress used in adjoining 
syllables. If this is so, we shall on investigation find a state of 
affairs in stress comparable to the variations of quality in the 
phoneme, length in the chroneme and voice-pitch in the toneme. 
Little information as to this appears to be available as yet, but it is 
reasonable to suppose, for instance, that the amount of decrease 
of force in a diminuendo stress would be different (probably less) 
when the following syllable is strongly stressed from what it is 
when the following syllable is weakly stressed. If a diminuendo 
stress has thus two or more values according to circumstances, 
those values can be grouped together and count as being the same 
for practical linguistic purposes. It is likely too that a level stress 
would suffice to indicate crescendo stress before a following weakly 
stressed syllable. In this case a level stress and a crescendo stress 
would have to be grouped together as a single linguistic entity, 
in the same sort of way as tambers are grouped together into 
phonemes. 

Stronemes 

468. The foregoing account of the nature of stress and its 
occurrence in language has been necessary in order to make clear 
how far stresses can be grouped together in a manner comparable 
to the grouping of sound-qualities into phonemes. It will be seen 

31 Tone-marks (' and “) might also be used to mark this distinction: 
thus mlada, mlada. 

32 Presumably the same as what Fry has analysed as "crescendo- 
diminuendo.” 

33 G. L. Trager, Serbo-Croatian Accents and Quantities in Language, Vol. 
XVI, No. 1, January, 1940. 

34 G. L. Trager, Serbo-Croatian Dialect in Le Maitre Phonitique, January, 
1940, p. 40. 
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that such grouping is possible in one case only, namely, where 
kinds of stress are concerned. If a particular kind of stress were 
termed a "strone,” we should no doubt find that in such a language 
as Serbo-Croat two or more kinds of stress would be groupable into 
a single "stroneme,” as suggested in the preceding paragraph. 
Owing, however, to the extreme difficulty of observing stress and 
of abstracting it from the other factors making for prominence 
which almost always accompany it, it is probable that the idea of 
the stroneme may be dismissed as of little or no value for any 
practical purpose. 

469. Apart from Serbo-Croat and other languages, if any, 
which exhibit similar peculiarities, it is the location of strong 
stresses and not their type which has significance in stress languages. 
In this respect stress differs from all other features of speech. In 
a stress language every word of more than one syllable must have 
a strong stress somewhere, 35 and one word may be distinguished 
from another by the position of the strong stress. Being however 
a matter of location and not of kind, ordinary stresses cannot be 
grouped in any way into families corresponding to phonemes, 
chronemes and tonemes. 


CHAPTER XXV 

TONEMES 

470. Languages in which voice-pitches are used for the purpose 
of distinguishing words are called “tone languages.” Such 
languages are very numerous. They include Chinese (all types), 
Siamese, Burmese, etc., Bantu languages of Africa (except Swahili), 
numerous languages of West Africa and the Sudan, Norwegian, 
Swedish, Lithuanian, Serbo-Croat, Panjabi and some American 
Indian languages. 

* 6 Reference is here made to “word-stress” only. In the sentence, where 
stress is used for other purposes than word distinctions, words of more than 
one syllable may be entirely without strong stress. Thus, the English word 
upon, said in isolation, has a strong stress on the second syllable (a'pon), 
but in connected speech it is often pronounced with a weak form apan in 
which neither syllable has a strong stress: e.g. 'djASt span 'ooui>] (just 
upon going), 'miljanz apan 'miljanz (millions upon millions). 
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The Nature of Tonemes 

471. In tone languages it is commonly found that the values of 
the tones of particular syllables vary according to the nature of 
neighbouring tones m the word or sentence or sometimes according 
to other factors. Often too there is variation according to the 
situation of words in the sentence. Tones may also be affected by 
sentence intonation. We thus find in tones a treatment resembling 
that obtaining in regard to phones (tambers): the tones of tone 
languages can be grouped together into "tonemes” in the same sort 
of way as phones are grouped into phonemes, and it is tonemes 
and not actual tones that distinguish one word from another. 

472. There is, however, a great deal of “overlapping” in tones. 
For example, the same actual mid pitch may stand for a high toneme 
in one tonal context and for a low toneme in another. Consequently 
the definition of a toneme cannot be exactly parallel to that of a 
phoneme. The following is suggested as a definition of a toneme: 
a family of tones in a given tone language which count for linguistic 
purposes as if they were one and the same, the differences being 
due to tonal or other context. It would seem impossible to restrict 
the conception of the toneme to the tones found in isolated words, 
since in so many of the tone languages the tonal variants are found 
only when words occur in conjunction with other words. 

473. The various tones comprised in a toneme may be called its 
“members” or “allotones.” Every toneme comprises one tone 
which can be considered as the “norm” or "principal” or most 
important member. It is generally the one which would be used 
when a word containing it is said in isolation, with the proviso, if 
necessary, that adjoining syllables in the word are said with 
specified tonemes. 

Examples of the Groupings of Tones into Tonemes 

474. In §§475-478 are given examples of the grouping of tones 
into tonemes in languages where tone is not linked with any other 
attribute. 

475. One of the tonemes in the Cantonese language may be 
termed "high-falling,” 1 since its principal member is a high-falling 
pitch. The words N to (much), N sin (previously), for instance, 
have this toneme and are pronounced with high-falling pitch 


1 The Cantonese themselves call it "level” (^phit)). 



154 


THE PHONEME: ITS NATURE AND USE 


when said in isolation or when they occur at the end of a sentence. 
In most non-final positions, however, such words are said with a 
simple high pitch in which no fall is perceptible. Thus s to N tP 
(very much), N sin N sag (Mr. 2 ) are said with pitches which would 
be represented graphically thus [' • \]. A simple high tone 
without perceptible fall is also used whenever the syllable ends 
with a stop consonant, i.e. unexploded p, t or k, as in ''gap (say), 
N jat (one), s tit (little), v tak (possible); the tone of these words 
would be graphically [~]. 3 

476. The method of allocating tones to tonemes may be illustrated 
by the tonal system of the Tswana language of South Africa, 4 a 
polysyllabic language in which every syllable requires an appropriate 
tone. The following are the chief tones found on single syllables 
in this language: high-level, mid-level, low-level, 5 high-falling to mid, 
high-falling to low, mid-falling to low. (See examples in §384.) Of 
these the three falling tones only occur on syllables which are pen¬ 
ultimate in the sentence and are long (with rare exceptions, see 
§479), and the low-level 5 only occurs in final position in the sentence. 
The fact that these four tones occur in such very special situations 
at once suggests that one justifiable method of tonemic classification 
would be to allocate these tones to the same tonemes as the much 
commoner high-level and mid-level tones. 


2 Lit. "previously born.” 

3 There are many varieties of Cantonese. The observations recorded here 
are taken from an analysis made by me of the pronunciation of Mr. Wu 
Kwing-Tong ( s WU' / kwig / ’thDQ) of Hongkong (see Jones and Woo, Cantonese 
Phonetic Reader). 

In Mr. Wu’s pronunciation there exists an "extra high level tone” which 
he uses in final position in certain cases. When the Reader was written, I 
regarded this tone as a variant of the high-falling tone employed solely in 
final positions as a kind of sentence intonation. I have, however, subsequently 
learnt that it undoubtedly constitutes a separate toneme, though I have 
no examples from Mr. Wu’s speech to prove this. Many conclusive examples 
have been furnished me by Mrs. Rose Fletcher, who is bi-lingual (English 
and Cantonese) and an expert phonetician, and who has examined the speech 
of many Cantonese. Illustrative words are s kwai (home) but ~kwai (tortoise), 
N sa:n (three) but - sam (clothes). It appears that in her type of Cantonese 
the distinction is even found in words ending with the stops. 

1 Rolong dialect. 

3 This tone may have a slight fall. 



TONEMES 


155 


477. This system works out as follows. All the tones in ordinary 
sentences are either high-level or mid-level with the exception of 
the two final ones. 6 In the two final syllables seven combinations 
are possible, viz. (1) high-fall low [ \. ], (2) high-fall mid [ s • ], 
(3) low-fall low [ N J, (4) low-fall mid [ v • ], (5) high-fall low-fall 
[ \vj. (6) low-fall low-fall [ ss ], (7) high-level high-fall [' \]. 
The last three of these are of very uncommon occurrence. Leaving 
these three out of consideration for the moment, we see that the 
high-fall may be regarded as a member of the same toneme as 
the high-level, the low-fall as a member of the same toneme as the 
mid-level, the low-level also as a member of the same toneme as 
the mid-level, while a terminal mid tone can probably be assigned 
to the same toneme as the high-level. In other words 

penultimate [ \ ] and [ s ] "stand for” [ ’ ] 

terminal [ • ] “stands for” [ ’ ] 

penultimate [ s ] and terminal [ ] "stand for” [ • ]. 

478. Thus the following words said in isolation might be written 
phonetically as follows with narrow tone-marking 7 : pu:la (rain), 
phiiko (ram), nfma (meat), kx^mo (cow). With broad tone¬ 
marking they might be written phonetically thus: .pula, ph5k5, 
nama, kxomo. 

479. The three rare terminal tone-sequences [ \s]. [ns] and 
[' \ ] show that there probably also exist two independent falling 
tonemes, a high-falling and a low-falling, the tones of which are 
not assignable to any level toneme. They completely overlap the 
other falling tones, but in view of the particular conditions under 
which they are used it seems necessary to regard them as indepen¬ 
dent tonemes. They are used almost exclusively on monosyllables, 
such as <|>a (here), mo (here), h (this 8 ). These would have to be 
marked with N and s both in narrow and in broad writing. Examples 
are, with narrow tone-marking, ki a ja: <j>$ (I am eating here), 
kl: €6 (here he is 9 ), or, with broad tone-marking, ki a ja fa, ki ed. 


8 Subject to the modification described in §480. 

7 The tone-marks used are as follows: N high-fall, s low-fall, _low-level, 
high-level; absence of mark denotes mid-level. 

8 Referring e.g. to poleb (story). 

* Lit. "it is this one" (referring to rjkwE, tiger). 
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The only cases I have come across of the use of the falling tone in 
the second syllable of a disyllabic word are the words ka:na (so big, 
so many) and ja$:n^ (thus); they would be written with broad tone¬ 
marking kan$, jaana. But there are presumably others. 

480. In addition to the tonal system above described there 
exists in Tswana a peculiar tonal phenomenon, which, though it 
has semantic function can hardly be classed as a toneme. It is 
that at various points in sentences a high tone has to be made 
slightly lower (roughly a semi-tone lower) than the last preceding 
high tone, and that all subsequent high tones in the sentence have 
to be on this lower level—unless and until another point comes 
where a similar lowering has to take place. Any mid tones 10 
following such a point are lowered to match. The points of lowering 
sometimes occur between words and sometimes in the middle of 
a word. Long sentences often contain quite a number of these 
lowerings. The points at which the lowerings take place appear 
to be prescribed solely by syntactical considerations. Thus the 
verb with a prefixed pronoun is always lowered in comparison with 
the subject preceding it, and the direct object is often 11 lowered as 
compared with the verb preceding it. The following are examples. 
(The points after which lowering takes place are shown by | ; the 
words are written with broad tone-marking.) 

tau | I b5na | ph5b<|>5b 12 (the lion sees the animal, lit. 

lion he sees animal; present 
tense of b5na) 

tau | i batla | ph5b<j>5b 12 (the lion is searching for the 

animal; present tense of 

batla) 

0 ka batla ph5b<j>5b 12 (I can search for the animal) 

o tjwala kxor5 ka sikapeb 13 (you shut the door with the 

key, present tense of tjwala) 

10 I.e. tones belonging to the "low” Tswana toneme. These are not very 
low except in final position. They axe usually from a minor third to a major 
third lower than the high tones; consequently, in ordinary sentences, there 
is ample scope for the further lowerings described in this paragraph. 

11 Not always. Whether lowering of the direct object takes place depends 
upon the tense of the verb. 

1J Narrow tone-marking of last word (if final) would be ph5b<j)5:b. 
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o tjwala kxa|r5 ka sikapeb 18 

but o tjwa|la ka sikopeb 13 

monna wa batho 14 

but mon|na | 5 molima 15 

sisana sixolo 16 
but sisana si|xolo 17 


(he shuts the door with the 
key; present tense of tjwala. 
This also means "while you 
shut the door with the key,” 
a tense of the participial 
mood of tjwala in the tonal 
form used when direct 
object follows) 

(while you shut with the key, 
a tense of the participial 
mood of tjwala in the tonal 
form used when the follow¬ 
ing word is not the direct 
object) 

(a man of the people, i.e. a 
poor man) 

(the man is good, lit. man he 
good) 

(a big stump (of tree)) 

(the stump is big) 


481. Lowerings of the type described above are not used to 
distinguish one isolated word from another, but they may distinguish 
words in connected sentences. Thus the 2nd person singular of 
a tense of the participial mood of tjhola (receive) is o tjhola when 
a direct object follows, but the same part of the verb tjhola 
(dish up) is a tJho|la when a direct object follows. 

482. The only way in which such a tonal usage could be brought 
into a tonemic classification would be to distinguish two kinds of 


11 Narrow tone-marking of last word (if final) would be SlkopElb. 

14 Narrow tone-marking would be manna wa baitho (if final). If another 
word were to follow, the tonal form would be monna wa ba|tho, or in 
some cases monna wa batho (depending upon the grammatical nature of 
the following word). 

15 Narrow tone-marking of last word (if final) would be mol^imD. 

14 Narrow tone-marking would be sisana SlXQflo (if final). 

17 Narrow tone-marking would be sisana Sl|xo:lo (if final). 
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high tone. These would be identical in actual tonal value, but 
one would involve a lowering of a subsequent high tone, whereas 
the other would have no effect on succeeding tones. Representing 
these two kinds of high tone by the signs ^ and ' respectively, 
the examples in the last paragraph could be indicated by the 
notation 5 tjhola and o tjhola. 18 

483. The existence of such a tonal system as that above described 
shows that it may be inconvenient and perhaps not always possible 
to group all the essential tonal phenomena of a tone language into 
tonemes. This does not, however, mean that the grouping of 
tones into tonemes is not a valuable procedure whenever it may 
be feasible. 

484. It is possible that there exist languages which have signi¬ 
ficant tones but no tonemes. In the course of a limited amount of 
work I was once able to do on the Shan language (which belongs to 
the Thai group) it seemed to me that the five tones of that language 
always preserve approximately the same relative values [ \ . s 

I did not discover any cases of a variant tone being used according to 
the tonal environment. It may be that in this language the only 
perceptible variations of tone that exist are the result of super¬ 
imposing a sentence intonation on the words. Variations caused in 
this way do not come within the scope of the present enquiry. 

485. Tones groupable into tonemes appear to occur in the 
Somali language. It is instructive to examine from this point of 
view the tone-marked texts in L. E. Armstrong’s The Phonetic 
Structure of Somali. 19 It will be seen there that the values of the 
Somali tones vary according to tonal context. We find for instance 
that the pitch of the first syllable of dameir (donkey, m.) may be 


“ A. N. Tucker, who has an unrivalled knowledge of Tswana, Sotho and 
allied languages, has completed a very detailed investigation of the tonal 
structure of these languages, supplementing and in some respects correcting 
the accounts given by previous workers in this field. He has worked out 
a new tonemic classification which is probably a considerable improvement 
on that suggested here, and he has evolved from it a simple system of tone 
marking well suited for purposes of orthography. For details readers are 
referred to his article Sotho-Nguni Orthography in the Bulletin of the School 
of Oriental and African Studies, Jan., 1949. 

*• See footnote 50 to §580. 
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either lower than or on a level with the beginning of the second 
syllable, i.e. [-\] or [,\] or [' \]. The pitch used would 
appear to depend upon the tones of preceding words. We see 
also that ticjj (she said), which normally has the intonation [ •. ] 
takes or may take the intonation [. . ] after a preceding falling 
tone. 20 Again, a low tone following a high-falling tone sometimes 
rises when a high tone follows, i.e. [ \ / ] occurs as a variant of 
[ \ _ ]. Further research is needed before we can describe precisely 
these tonemes and the variant tones which are comprised in them. 
The facts set out above seem to me, however, sufficient to demon¬ 
strate that the toneme principle will be found to apply in Somali, 
and is likely to elucidate certain features of that language which 
have hitherto been obscure. (Some of the tones may be associated 
with special types of stress. For this, see §§580 ff.) 

486. The principle of the toneme appears to be applicable in 
Serbo-Croat. The tones of the two length-tone complexes of that 
language (§§466, 583) are not always constant, but they vary to 
some extent with the phonetic context. Thus when the word 
written in the Fry-Kostic system ,mla’da (young) is said in isolation 
it is pronounced with the tone-sequence [''•], but when the word 
is immediately followed by a high-pitched syllable, its tone is of 
the type [-• ]. 

Linking of Tones with Other Attributes 

487. It has already been mentioned (§446) that significant 
differences between words are often effected by combinations of 
attributes. This subject is dealt with fully in Chapter XXVI. 
Here we may confine ourselves to remarking that tone may be 
linked to tamber or to duration or to stress. Examples of the first 
are found in Pekingese (§§112, 242, 257, 543), of the second also in 
Pekingese (§557) and of the third in ordinary “stress languages” 
such as English, German, Italian, Russian and Greek (§§570-574), 
and in tonal systems such as those found in Norwegian and 
Swedish (§§575-578). 


ao It is possible that this word is weakly stressed in the cases where it 
occurs in Armstrong’s text No. 2, and that the tone [. . ] is a concomitant 
of the weak stress. 
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CHAPTER XXVI 

DISTINCTIONS BY COMPLEXES OF ATTRIBUTES 

488. Minimal distinctions 1 between possible sound-sequences of 
a given language are sometimes effected by a change in a single 
attribute, such as the substitution of one tamber for another or 
the substitution of a long sound for a short one. The English 
syllables si:n (seen) and sein (sane), for instance, are distinguished 
solely by their vowels; there are no differences perceptible to the 
ear in the s’s or n’s. Likewise the Japanese words taiko (drum) 
and taiko: (opposition) 2 are distinguished solely by the length of 
the vowel. 

489. Such cases are, however, comparatively rare. Minimal 
distinctions much more commonly involve changes in two or more 
attributes simultaneously, e.g. two changes of tamber, the addition 
of a sound together with a change in some other sound, a change of 
length accompanied by a change of tamber, a change of stress in 
combination with a change of voice-pitch. Thus the English 
syllables kirn (keen) and kein (cane) are distinguished not only 
by their vowels but also by the use of perceptibly different varieties 
of k. In Southern English pronunciation 'geili (gaily) differs from 
geil (gale) not only by the addition of i but also by the use of a 
different variety of I (§11). 8 The Sinhalese words si:ta (cold) and 
sita (mind), quoted in §102, are distinguished by the length of the 
first vowel combined with the slight quality difference indicated 
in Figure 4 (p. 29). The English noun 'impo:t (import) is distin¬ 
guished from the verb im'po:t by a difference of stress which is 
ordinarily accompanied by a difference of intonation (see §§457-459). 

490. It is the very fact that most minimal distinctions require 
differences in two or more attributes simultaneously that gives 
rise to the phoneme theory. If minimal distinctions were always 
effected by changes in a single attribute, there would be no phonemes. 

491. When, as is usually the case, a minimal distinction is 
effected by simultaneous changes in two or more attributes, it 

1 See Chapter VI. 

• See §378. 

* The addition or subtraction of a sound is to be regarded as a particular 
case of change of sound—the replacement of a sound by zero or vice versa. 
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generally happens that the change in one attribute greatly pre¬ 
dominates over the change (s) in the other (s). The change in one 
attribute is in fact the “essential” or "primary” one, while the 
change(s) in the other(s) may be considered as “incidental to” or 
“conditioned by” the primary change. It is evident, for instance, 
that such English syllables as kirn, kein and ko:n [corn) are 
distinguished essentially by their vowels, and that the accompanying 
differences between the k’s is merely incidental to the vowel 
distinction. Similarly the Sinhalese words si:ta and sita are 
differentiated essentially by the length of the first vowel; the slight 
difference of tamber accompanying this is incidental and compara¬ 
tively unimportant. 

492. The examples of phonemes given in the preceding chapters 
have for the most part illustrated cases where there is no doubt 
as to which change of attribute is the primary one. Cases exist, 
however, where the change in one attribute does not predominate 
in such a way as to leave no doubt that it is the primary means of 
differentiation. Cases indeed occur in which the differentiation is 
distributed between two or even among three attributes in such a 
manner as to render it difficult to say which takes the greater 
share in effecting the distinction. 

493. It is easy to understand why this may be so, since what 
is essential in language is that the total acoustic difference between 
one word and another should be adequate. It does not matter 
by what means a sufficient acoustic distinction is arrived at. It 
may be produced by merely substituting one sound for another 
or one degree of length for another, as mentioned in §488. Or it 
may be produced by a considerable change in one attribute accom¬ 
panied by relatively small changes in others, as mentioned in §489. 
But equally effective acoustic distinctions can be made by means 
of slighter differences in two or three attributes—differences of 
such a nature that no one of them would be of itself adequate to 
distinguish clearly one word from another. 

494. Thus if there is only a moderate duration difference between 
two words of a language, clear differentiation between them necessi¬ 
tates coupling with it a difference in some other attribute, usually 
tamber. The smaller the duration difference the greater must be 
the difference in tamber or other attribute. And a point can be 
reached at which the relative differentiation value of the differences 
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of duration and tamber are approximately equal—where, in fact, 
it is impossible to regard the one as primary and the other incidental. 

495. The principle is well illustrated by comparing the usage 
in Sinhalese with that in Southern English. In Sinhalese, as 
already mentioned in §§102, 489, 491, the differences of vowel 
duration are considerable and the accompanying differences of 
tamber are slight. In Southern English, on the other hand, the 
differences of vowel duration are less well marked so that the 
requirements of distinctness necessitate wider tamber differences 
(see §§511 ff„ also §§402 ff.). 

496. When a minimal distinction is effected by a primary 
change in one attribute accompanied by incidental changes in 
another attribute, the only feasible phonemic grouping is that 
made on the basis of the primary attribute and without reference 
to the secondary attribute. Thus as the differences between the 
k’s in the English words ki:n, kein, ko:n are evidently incidental 
to the differences between the vowels, these different k’s must 
be held to belong to the same phoneme. In the unusual cases where 
the differences in two attributes are of about equal value, it is 
possible to make an adequate phonemic grouping in more than one 
way (see for instance §§514r-516). 

497. The subject of minimal distinctions by complexes of 
attributes has been touched upon in various places in preceding 
chapters. It is now necessary to discuss this matter in greater 
detail. 

498. The following combinations of two attributes are capable 
of being used for minimal significant distinctions, lhere are 
doubtless others: 

(1) a tamber combined with another tamber in an adjacent 
or a neighbouring position, 

(2) tamber combined with length of the sound in question, 

(3) tamber combined with length of an adjoining sound, 

(4) tamber combined with a degree of stress, 

(5) tamber combined with a particular kind of stress, 

(6) tamber combined with voice-pitch, 

(7) the duration of a sound combined with the duration of an 
adjacent sound, 
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(8) duration combined with a degree of stress, 

(9) duration combined with a particular kind of stress, 

(10) duration combined with voice-pitch, 

(11) a degree of stress combined with a degree of stress on an 
adjacent syllable, 

(12) degrees of stress combined with voice-pitch (intonation), 

(13) particular kinds of stress combined with voice-pitch, 

(14) voice-pitch combined with voice-pitch on a neighbouring 
syllable or syllables, 

(15) voice-pitch combined with voice-quality. 

499. In the following pages examples are given of each of these 
cases in the above order 

(1) Distinctions by Two Adjacent or Neighbouring 
Tambers 

500. As has already been pointed out, many minimal distinctions 
are effected by a major change of tamber accompanied by a minor 
change of an adjacent tamber, and English syllables like ki:n,. 
kein and ko:n (corn) have been adduced as instances of this. Further 
instances are seen in the French words mjti (mien) and tje (tien)\ 
the essential difference between them lies in the initial consonants, 
but there are also differences between the two j’s, the j of mj£ 
being fully voiced and that of tje being partially breathed. And 
so on. 

501. Sometimes, however, the substitution of one tamber for 
another is accompanied by a change in an adjacent sound which 
cannot be regarded as a "minor” one. When a sequence is such 
that an alteration for a minimal distinction involves major changes 
in two neighbouring sounds, it may be termed a "linked sequence.” 
There is, of course, no hard and fast line to be drawn between 
major and minor changes. In the following paragraphs, however, 
we give some instances where it is reasonable to regard both elements 
of a change as being major alterations. 

502. [nt] and [gk] are linked sequences in Spanish; the distinc¬ 
tion t/k is evidently the principal one, but the concurrent change 
n/g is also considerable. As already shown in §92, [an] and [ag] 
are linked sequences in Danish; the differences in both elements 
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are considerable, so much so that the difference a/a is sometimes 
the preponderating one to the ear, though the phonemic distinction 
is undoubtedly n/g (§§181, 182). 

503. The following are instances of linked sequences occurring 
in Pekingese. That language contains pairs of “finals" which may 
be transcribed narrowly as -Ag, -an and -ag, -an. -Ag and 
-an occur, for instance, in [~fAg] (wind) and [Tan] (separate), 
and there are no such syllables as [ - fAn] or [Tag]. Notable as 
these vowel differences are, we find that the difference n/g must 
always be considered the primary one. For in Pekingese there 
is always a definite line of demarcation between the sounds n and 
g, whereas there is no definite line of demarcation between each 
pair of vowels. 4 Besides which in the case of the third analogous 
pair of finals -ig, -in the difference of vowel quality is insignificant. 
Broad transcriptions of the above finals would therefore be 
-en, -eg, -an, -ag. 

504. Russian syllables such as [bji] and [bi] are also linked 
sequences, since the opposite sequences [bi] (i.e. "hard” b followed 
by i) and [bji] (i.e. "soft” b followed by i) do not occur in the 
language. The distinction between the existing syllables is effected 
by a change of consonant (bj'/b) with a concurrent change of vowel 
('/+)• 

505. Attention has already been called to this case in §97, and 
reasons why sounds like bj and b must constitute separate phonemes 
were given in §95. We may add here that to determine which 
change is the primary one, in comparing two linked sequences, 
it is sometimes needful to examine what happens in other sequences. 
A foreign observer hearing the Russian syllables [bji] and [bi] 
for the first time would, I think, be fairly certain to take the striking 
difference of vowel to be the primary means of distinction. And if 
he were to see the Russian orthographic forms (6n, 6bi), he would 
regard them as lending support to this view. An investigation 
of all the circumstances leads, however, to the conclusion that the 
consonants should be regarded as the primary distinguishing 
elements, and that the differences of vowel, wide though they may 
often be, are secondary. We can see this from the treatment of 

1 Their qualities vary to some extent with the tones. Thus, the a of an 
is more A-like when the syllable has the third tone than when it has other 
tones. 
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analogous sequences with other vowels [bje] and [be], [bja] and 
[ba], etc. In the case of these other vowels numerous shades 
are to be found in different connexions, shades which merge into 
one another by imperceptible degrees. There are no sharp lines 
of demarcation which would render it possible to divide each set 
of vowels into two phonemes. Each set of vowels, i.e. e, e and 
intermediates, a*, a, a and intermediates, etc., must therefore con¬ 
stitute a single phoneme, and the above-mentioned distinctions 
must be attributed primarily to the consonants bj and b. bj and 
b must in fact be considered as separate phonemes, and any vowel 
distinctions which may accompany them must be regarded as 
secondary. If this applies to [bje], [be] and to [bja] and [ba], 
we cannot do otherwise than apply it also to [bji] and [bt] in spite 
of the particularly wide difference of vowel sound found in this case. 
The Russian sounds i and t must thus be held to be members of a 
single phoneme. This fact was known to Baudouin de Courtenay 
(the discoverer of the phoneme theory) long before 1900. It was 
stated again much later by Trubetzkoy (Grundzuge der Phonologie, 
p. 48, in Cantineau’s translation, p. 54), by Bloomfield, and by 
Boyanus in his paper The i - t Phoneme. 

506. The best broad transcription of the syllables [bji] and [bt] 
is thus bji, bi. The notation bi (representing bji), bt would also 
distinguish these particular syllables adequately, but would involve 
an inconsistent use of the letter b. 

507. Languages exhibiting the phenomenon known as "vowel 
harmony” also illustrate the.principle with which we are concerned 
here. In that type of language the change of a vowel in one syllable 
involves a simultaneous change of vowel in an adjoining syllable. 

508. Vowel harmony is found, for instance, in Telugu. It 
may be illustrated by such sequences as [atti], [atta], [ottu]. The 
sounds a and a are dependent upon phonetic context, and their 
use is confined to situations like those just mentioned—a when 
the following syllable contains i, and a when the following syllable 
contains u. There are no sequences [atta], [attu], [atti], [atta], 
nor do a or a front a occur in monosyllabic words, i, a and u, on the 
other hand, are independent sounds of the language, and occur in 
monosyllabic words. It is therefore clear that they may be said 
to “constitute” phonemes and are the primary sounds in the above 
linked sequences, a and a cannot be otherwise than subsidiary 
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sounds, and must evidently be assigned to the a-phoneme (which 
would normally be written a). 

509. Interesting instances of vowel harmony involving sequences 
of an unusual kind are found in the Igbo language of Nigeria. This 
language presents the peculiarity that certain pairs of vowels are 
difficult to distinguish in isolation, but their recognition is facilitated 
in a great many cases by the nature of the vowels used in adjoining 
syllables. 5 6 The vowels of Igbo fall in fact into two classes and as a 
rule only vowels belonging to the same class occur in a given word. 
The first class comprises the sounds i, £, o and u, while the second 
class comprises e, a, a and o.* i and e are rather near to each 
other in sound (the i being a not particularly close one), as also 
are u and o. But in words of more than one syllable the distinctions 
are helped out by the fact that if i or u occurs in a word, the other 
syllables of that word can only contain i, e, o or u; and if e or o 
occurs in a word, the other syllables generally contain e, a, d or a. 
For example, there are words izu [' • ] (stealing), izo [' • ] (hiding) 
and ezo [' • ] (buying), 7 but combinations such as izo, ezu do 
not occur. The verbal root zo [ ‘ ] (buy) is intermediate in 
sound between the verbal roots zo [ ’ ] (hide) and zu [' ] (steal) 
and is liable to get confused with them if said in isolation; but 
when the verbal noun prefix (which is i- or e- according to the 
root vowel of the verb) is added, it becomes quite clear in each 
case which verb is meant. 

(2) Distinctions by Tamber Combined with Duration 
(a) Vowels 

510. Distinctions by a complex of tamber and duration of 
vowels are very common. Examples were given in §§104-106. 
Here we will examine the question at greater length. 

511. In ordinary Southern English we find, as has already 
been pointed out, several cases where vowels of different quality 
are connected by a length relationship. Thus, in the type of English 
I use, the sounds written in broad transcription i: and i differ 
both in duration and quality. These attributes are subject to the 
relationship that the longer sound always has the "closer” quality 

5 This was first discovered by Professor Ida Ward. I am indebted to 
her for the particulars given here. 

* Q denotes a sound intermediate between o and u. It is written (j in the 
new official orthography. 

7 Ward, Introduction to the Ibo Language, p. 3. 
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in a given phonetic context. So that if the stress and intonation 
conditions are constant, 

the closer vowel in deed, is longer than the opener vowel in did, 
the closer vowel in seen is longer than the opener vowel in sin, 
the closer vowel in sheep is longer than .the opener vowel in ship, 
the closer vowel in the first syllable of aesthetic (i:s'9etik) is longer 
than the opener vowel in the first syllable of estate (is'teit). 

512. The pairs of vowels in caught and cot, pool and pull are 
in the same case. The vowels have quite distinct qualities, but 
in my type of English these qualities are connected by the relation 
that one is always longer than the other in similar phonetic contexts. 
The first vowel in naughty is, for instance, longer than that in 
knotty, and the vowel in hoot is longer than that in foot. 

513. A similar length relationship between the vowels in psalm 
and Sam, halve and have, etc., is also to be found in Southern English 
speech. 8 

514. It is not always easy to decide what is the best manner of 
making the phonemic classification in cases such as the above. 
There are, as was indicated in §§492-494, border-line cases where 
the distinction between sound-sequences appears to be about 
equally divided between the length and the tamber. 

615. The above-mentioned distinctions found in my type of 
Southern English are near to this border-line. They are, however, 
cases where in my view the length difference may be considered 
to preponderate, as already mentioned in §104. This is the 
traditional way of regarding the sounds in question: the vowel 
in deed is generally considered to be the “long” of the vowel in 
did, and the vowels in caught, pool, psalm are usually thought of 
as being the “longs" of the vowels in cot, pull, Sam. In other 
words, the traditional, and to my mind the most practical, method 
of grouping these vowels phonemically is to assign the qualities 


* With many speakers (including myself) the relationship referred to in this 
paragraph does not hold good. This is because we often lengthen the tradi¬ 
tionally short a in a number of words, e.g. man, bad, bag, grand. With these 
speakers the two vowel qualities belong to separate phonemes, and must be 
written in broad transcription with separate letters (a and a, or a and E or 
36). Length need not be marked except in recording the speech of those— 
and there are apparently some—who make chronemic use of length on the 
front a (se). (See Outline of English Phonetics, §§874, 875.) 
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of these corresponding longs and shorts to single phonemes. On 
this basis the sounds would be written broadly with single letters 
with and without length-marks: i:, i, o:, o (or ax, a), u:, u, a:, a. 

516. Being border-line cases, it would equally be possible to 
regard the tambers as constituting the primary differences, and the 
differences of length as incidental. Looked at thus, the tambers 
would all be assigned to separate phonemes, and would be repre¬ 
sented in broad transcription by separate letters. The lengths 
being then incidental would not be marked in broad transcription: 
we should write the sounds by some such system as i, i, o, a, u, 
a, a, a (or ae, a for the last two). 

517. In border-line cases analogy with the treatment of another 
sound may be suggestive. Thus the treatment of the Southern 
English long vowel in bird may be taken into consideration in 
coming to a conclusion as to the best phonemic grouping of the 
other English vowels just mentioned. It has been pointed out 
(§§197-202) that the vowel of bird is best assigned to the same 
phoneme as short a, since its quality is practically identical with 
one of the varieties of short a . 9 The fact that a: and a are best 
treated as here described is not necessarily a reason for treating 
other pairs of vowels on similar lines. If, however, the other 
pairs are susceptible of phonemic grouping in more than one way, 
it is clearly advantageous to select the system which gives con¬ 
sistency of treatment to all the pairs, namely, the grouping suggested 
in §515. 

518. Symbol economy in transcription is also a principle to 
which consideration may be given in border-line cases. It points 
in favour of the grouping of the above vowels as corresponding 
longs and shorts, as indicated in §515. 

519. The above-mentioned relationships between the duration 
and tainber of various vowels are found not only in normal Southern 
English, but also in the English of Northern England. 10 It must 
be observed, however, that there are other varieties of English in 
which this traditional relationship does not hold good. Notable 

9 Outline of English Phonetics, §356. 

10 Except that words like psalm and Sam are not distinguished in the 
speech of many Northerners. They use an intermediate short a in both words. 
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among these is one type of Southern speech, in which the traditionally 
short vowels are in some phonetic contexts lengthened and made 
as long as the traditionally long vowels. 11 Those who speak in this 
way will pronounce such words as six, lock, good in isolation with 
long vowels, and there is no perceptible difference in length between 
these and their vowels in cease, walk, food. The distinctions are 
by tamber only. These vowel-qualities must in this type of speech 
be all assigned to separate phonemes, and must be written in broad 
transcription with separate letters without length-marks, e.g. by a 
system such as that suggested in §616: the above words would be 
written siks, bk, god, sis, wok, fud, with the convention that all 
the vowels are or may be long. 

620. The relationship between the traditionally long and short 
vowels likewise does not hold good in Scottish English, though for 
different reasons. Scottish speakers generally shorten the tradi¬ 
tionally long vowels when a consonant follows. They pronounce 
week with a short close i and distinguish this word from wick by 
tamber only. Pool and pull are not distinguished at all, but 
both are pronounced with a short close u. 12 Similarly caught 
and cot are not generally distinguished in Scotland, but are both 
pronounced with a short vowel (about Cardinal Vowel No. 6). 13 
These words would be written broadly wik, wik, pul, kot, with 
the convention that the vowels are all short. 

521. In American English too there appear to be as a rule no 
consistent relationships between the lengths and qualities of vowels. 
It is therefore necessary to regard the qualities corresponding to the 


11 This type of Southern English is, I believe, a modem development, 
since it is heard chiefly from younger people. There is a possibility that it 
may some day supersede what may be called, for want of a better term, the 
"normal" Southern English manner of pronouncing, though I am disposed 
to think that the influence of Northern and other forms of English will prevent 
it from doing so—unless, as is possible, American speech, which shows similar 
characteristics, comes to have a preponderating influence. 

11 Generally a rather forward variety (ii). 

ls It would appear, however, from some books dealing with the phonetics of 
Scottish speech that some Scotsmen make a quality difference between the 
vowels of such words as cot and caught. The letter Q has been used to denote 
the vowel in caught by those who differentiate this word from cot. See Grant 
and Robson, Phonetics for Scottish Students and the Phonetic Reader for 
infants, which was published by the Bon Accord Press, Aberdeen, in 1918. 
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traditionally long and short vowels as belonging to separate 
phonemes, and to write them with separate letters in broad trans¬ 
cription, thus i, l; e, e; o, d; u, q, etc. 14 

522. In a common variety of Scottish pronunciation 15 (referred 
to in my Pronunciation of English, 4th edition, Cambridge, 1958, 
§§180, 181) there exists a pair of diphthongs exemplifying a complex 
of duration and quality. These two sounds are represented 
phonetically in The Pronunciation of English by the symbols Ai 
and ae. ae is always longer than Ai, as well as having very distinct 
qualities; their features, both of length and of quality are illustrated 
in Fig. 18 on p. 58 of that book. What is particularly remarkable 
about these diphthongs is that they constitute separate phonemes 
in this Scottish pronunciation, while they both belong to the ai- 
diaphone of English as a whole. All the Southern English ai’s are 
rendered in Scottish by one or other of these diphthongs. This is 
illustrated for instance by the fact that the words tide and tied are 
distinguished as tAid and taed in the Scottish pronunciation, while 
in Southern English both words are pronounced taid. 

523. Combinations of length differences with wide differences 
of quality are found in Dutch and in Hungarian. Dutch contains, 
for instance, close e and o which are always relatively long and 
open e and d which are always relatively short, a front a which is 
always relatively long and a very back a which is always relatively 
short. Hungarian has also several pairs similarly distinguished, 
e.g. there is a front a which is always long and a very back a which 
is always short. 1 * 

524. It is convenient, for purposes of symbol economy if not 
for any other reason, to consider the length as the primary distinction 
in these pairs and the tamber differences as incidental. This 

11 Or, indicating the slightly diphthongal character of the sounds derived 
from the traditionally long closer vowels: ij, i; ej (or ei); OW (or ou), O; 
UW, U. This manner of using j and w is recommended by Bloomfield, 
who however uses D for what is here written O, and o for A. 

15 The main particulars of this type of Scottish pronunciation were first 
brought to my notice by J. Braid wood (a Scot) who is Lecturer in the 
Department of English, University of Belfast. I am indebted for other details 
to David Abercrombie, Head of the Department of Phonetics in the University 
of Edinburgh. 

M The Hungarian back a is almost identical with the Southern English short 
O-sound in hot. 
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indeed is the traditional way of regarding the sounds, and it is 
reflected in the common orthography. In broad transcriptions we 
therefore mark the length and we write Dutch e with e and o with o, 
and the Dutch and Hungarian back a-sounds with a. Examples 
are Dutch be:n {been, leg), ben 17 (ben, am), ro:s ( roos , rose), ros 18 
(ros, horse), ta:k ( taak , task), tak 19 (tak, bough); Hungarian 'adorn 
(dlom, dream), 'halom 20 ( halom , heap), va:d (vdd, accusation), vad 21 
(vad, wild animal). 

525. Several pairs of German 22 vowels are likewise distinguished 
by complexes of length and tamber, though except for the e and o 
sounds the tamber differences are less than those of English, Dutch 
or Hungarian. The length-differences are as a rule greater than the 
corresponding differences in English. This is as one would expect 
(see §§492-496). There are six such pairs, and I see no adequate 
reason for departing from the traditional view that the sounds 
should be considered as pairs of corresponding longs and shorts. 
Classifying them thus the sounds can be written in broad trans¬ 
cription by six letters with and without a length-mark. Examples 
are Jti:l ( Stiel ), Jtil (still), 'biitet ( bietet), 1 bitet 23 (bittet), 'i:nen 
( ihnen), 'inen 24 (innen), 'ha:ken ( Haken ), 'haken (hackeri), 'o:fen‘ 
(Ofen), 'ofen (offen), 25 mu:s (Muss), mus 28 (muss), 'fy:len (fuhlen), 
'fylen 27 (fallen), 'hade (Hohle), 'hale 28 (Holle). 

526. It would be possible to treat the tambers as the primary 
means of differentiation in such pairs of German words, as also 
in similar pairs of Dutch and Hungarian words, and to treat the 
length differences as incidental. Classifying on this principle the 


17 Narrower transcription ben. 

18 Narrower transcription os. 

18 Narrower transcription tak. 

20 Narrower transcription 'halom. 

21 Narrower transcription vad. 

22 Stage pronunciation, as described by Vi£tor ( Die Aussprache des 
Schriftdeutschen, etc.). 

23 Narrower transcription 'bltat. 

24 Narrower transcription 'Pinan. 

25 Narrower transcription 'Pofan. 

28 Narrower transcription mos. 

27 Narrower transcription 'fvlan. 

28 Narrower transcription 'hoela. 
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above twelve German tambers would all be counted as separate 
phonemes. Twelve letters would be required to write them in 
broad transcription, and the length-mark would be superfluous. 
From the point of view of writing and printing as well as on 
linguistic grounds, such a system would, in my view, be less satis¬ 
factory than the other. 

527. Differentiations of vowels by complexes of duration and 
quality are found to a limited extent in “monosyllabic” languages 
of the Far East. Thus in Cantonese, in Thai (Siamese), and in 
Shan we find a vowel resembling the English a: which is always 
long and a vowel resembling the English a which is always short. 
In most of these cases it seems convenient to treat the duration 
as the primary distinguishing element and the difference in tamber 
as secondary. In broad transcription we should therefore write 
the a as a. Examples are: Cantonese s sa:m (three), s sam 29 (heart); 
Thai 'sa:n (to weave), 'san 30 (sharp edge), _a:p (to bath), _ap 31 
(stuffy); Shan "main (pregnant), "man 32 (his), _ma:n (glass, be 
correct), .man 33 (hold tight). 

528. In Vietnamese, however, where a similar distinction is 
found, it seems better to mark the differences by using separate 
symbols, treating the length as incidental. For this, see the notes 
on the phonetics of Vietnamese by E. Henderson in Le Maitre 
Phonetique, January, 1943, pp. 7, 8. 

529. There are many other languages besides those above 
referred to in which vowels are differentiated by complexes of 
duration and tamber. In fact where essential differences of length 
exist, these differences would appear to be more often than not 
accompanied by quality differences. Even in the languages which 
are reputed to distinguish words by duration alone we very frequently 
find some accompanying differences of quality. Sinhalese is a case 
in point, as we have already seen (§§102, 103). Another is Estonian. 
This latter language contains nine vowels, all of which can be 
long or short. Close observation shows that in six of these (i, e, 
o, u, y and s) slight differences of quality accompany the differences 

29 Narrower transcription x SAm. 92 Narrower transcription ~mAn. 

30 Narrower transcription ^SAn. 33 Narrower transcription _mAn. 

31 Narrower transcription _Ap. 
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of duration; in only three cases (e, a and 0 ) are the qualities of short 
and long vowels indistinguishable. 34 There even appears to be a 
slight difference of quality between the French long and short £ 
in the pronunciation of many French people. 

(b) Consonants 

530. Differentiations by complexes of tamber and duration in 
consonants are comparatively rare. They form, however, a 
conspicuous feature of Tamil. In that language long voiceless 
plosives occur in intervocalic positions, but the corresponding 
short sounds in similar positions are in the case of p and t fricative 
and voiced (p, 8), in the case of t a voiced flap (cj. or [), and in the 
case of k a fricative (a variety of x, sometimes also voiced, y). Thus 
there are words which would be written in narrow transcription 
teppam (raft), kepi (cave), katti (knife), ka3i (heavenly state), 
kaiftu (show), kaxju (jungle), and, as already mentioned in §86, 
pakkam (side), maxan or mayan (son), ajugi (decaying), p, 8, x 
are never lengthened. Nor are medial p, t, t, k ever quite short; 
they are at times shorter than completely doubled consonants, but 
are never shortened to such an extent as to render it impossible 
to regard them as on the whole long. 

531. As with other complexes, one may either take the length 
or the tamber to be the primary distinguishing feature. It is 
convenient on practical grounds to regard the length as primary 
and the tambers as subsidiary in this language, since this plan 
effects an economy of several letters in writing. In broad trans¬ 
cription kepi, ka3i, kaicju, maxan, ajugi would thus be written 
kepi, kati, kaifu, makan, ajuki. 35 

(3) Distinctions by Tamber Combined with Length of 
an Adjoining Sound 

532. Differentiation of sequences by two rather similar sounds 
together with a difference in length of an adjoining sound was 
exemplified in §§187-191. It was shown there that in common 
types of English the somewhat similar pairs of sounds t and d, 

34 See L. Krass, The Phonetics of Estonian. 

36 Most of these particulars and examples are taken from J. R. Firth’s 
Short Outline of Tamil Pronunciation to which readers are referred for further 
information. 



174 THE PHONEME: ITS NATURE AND USE 

s and z, etc., are used for differentiating words, but that when 
they are in terminal position the distinction is strengthened by a 
difference in the length of the preceding sound. When the final 
sound is d, §, z, etc., a preceding vowel is much longer than when 
the final sound is t, k, s, etc. Pairs of words such as si:d (seed) 
and si;t (seat), roud (road) and rout (wrote), are thus distinguished 
by a complex of two attributes: the tamber of the final consonant 
and the length of the preceding vowel. 

533. Cases like lend and lent, paind (pined) and paint (pint), 
penz (pens) and pens (pence) are similar, except that the length is 
spread over the vowel and the penultimate consonant; it generally 
affects the consonant more than the vowel (§191). 

534. In all these cases it is necessary, as already indicated in 
§§190-191, to regard the tamber as the primary distinguishing 
element and the length as secondary, since there are cases where 
differentiations are made by these sounds without the aid of 
length. 

(4) Distinctions by Tamber combined with a Degree 
of Stress 

535. Often differences of stress are not accompanied by any 
perceptible quality differences. Thus there are no appreciable 
differences between the qualities of the vowels of the strong and 
weak syllables of such English words as 'abstrakt (abstract, adj.), 
'hAmbAg (humbug) or between the vowel in sekt (sect) and the 
weakly stressed vowel in 'insekt (insect). 

536. There is, however, at least one important language, namely 
Russian, in which differences of stress regularly involve differences 
of tamber. There are, for instance, varieties of i occurring ex¬ 
clusively under weak or medium stress, while the use of close 
i-sounds is confined to strongly stressed positions (see §§332, 333). 
Again, the sound which we may call the “normal” Russian a, as 
in da (aa, yes) when said in isolation, is used only in strongly stressed 
positions. Its place is taken in medium and weakly stressed 
positions by sounds of the a and a types. In particular, a is used 
to the exclusion of a and a (1) in pre-tonic positions, 36 (2) in pre-pre- 


36 I.e. in syllables immediately preceding the strongly stressed syllable of a 
word of more than one syllable. 
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tonic positions 37 when initial in the word, and (3) in certain pre- 
pre-tonic sequences of vowels. 370 a on the other hand is employed 
to the exclusion of a and a in all post-tonic positions, and in pre- 
pre-tonic positions other than those appropriate for a (3 above). 
The cases are illustrated by the following examples: [vA'da] (boas, 
water), [dA'vatj] (flaBaTb, give), [kAr'man] (napMaH, pocket); 
[apjilj'sjin] (aiaejibCHH, orange); [nAu'cJatj] (HayaaTb, to teach); 
[kAApji'ratsija] (Koonepaima, co-operation); [xarA'Jo] (xopoiuo, 
well); [bbgsdA'rjitj] (bjxaroAapnTb, thank); [‘komnata] (noMHaTa 
room); [‘jablaka] (hSjioko, apple); ['gorat] (ropoA). 


537. Similar considerations apply to the other Russian vowels. 
Thus an obscure variety of u [o] occurs to the exclusion of normal 
u in syllables that do not bear strong stress: for instance, the 
medium stressed u in bu'maga [bo'maga] (6yMara, paper) differs 
in quality from the strongly stressed u in 'dumatj ['dumatj] (AyMaTb, 
think). 

538. Although in Russian differences of stress are accompanied 
by differences of tamber, it is clear that the stress must always be 
regarded as the primary means of differentiation and that the 
differences of tamber are incidental. The reason is that the tamber 
differences are numerous and some of the shades differ but slightly 
from each other; the qualities of the weakly stressed vowels do not 
in fact fall into sufficiently definite families to make it practicable 
to take them as a primary means of differentiating sound sequences. 


539. It is clear therefore that the Russian sounds a, a and a 
belong to a single phoneme. This being so, appropriate broad 
transcriptions of the examples in §536 are va'da, da'vatj, kar'man; 
apjilj'sjin; nau'tjatj, kaapji'ratsija; xara'Jo, blagada'rjitj, ‘komnata, 
'jablaka, 'gorat. Similarly with other Russian vowels. 


37 I.e. in syllables prior to the pre-tonic (at any distance). 

370 These are listed by D. Ward in his Russian Pronunciation, pp. 20, 21. 
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540. The same manner of distinguishing sequences is exemplified 
to a limited extent in Italian and Spanish. It is noteworthy for 
instance in Italian that a final weakly stressed o always has an 
"opener” value than o in other situations (see §210). The two 
o’s in 'dopo, for instance, have distinctly different qualities. And 
in both Italian and in Spanish weakly stressed a in final position 
has a somewhat more obscure (e-like) quality than the strongly 
stressed a’s. 

541. It is appropriate to add a word here on the subject of the 
use of e in English, supplementing what was said in §337. Although 
this sound only occurs in weak syllables, its use does not illustrate 
the principle with which this section is concerned; for, in my opinion, 
it does not occur in weak syllables to the exclusion of other vowels. 
We find s, it is true, "representing” various other vowels in 
■foutagraf, fa'tografi, |fouta'grafik, 'memari, ma'mo:rTal, 38 'aksidant 
laksi'dentl, 'regjula, (regju'lariti, etc. But with these must be 
compared such cases as ‘deskant (cp. 'distant), 'kontrast (cp. 
'brekfast), 'pi:kok (cp. 'hadak), and the examples in §535, in which, 
in my view, e, a, o and a have weak stress, and which therefore 
show that a constitutes a phoneme separate from a, o, a, e, etc. 
(see also §148). (For another view, see §204.) 

(5) Distinctions by Tamber combined with Type 
of Stress 

542. Distinctions by this combination are very uncommon. 
They have, however, been observed by G. L. Trager in a Croatian 
dialect, where it appears that certain vowel qualities (e.g. a some¬ 
what diphthongal e: £ ) are associated with diminuendo stress and 
certain other vowel qualities (e.g. pure e:) are associated with 
crescendo stress. (See the article by him on the Serbo-Croatian 
dialect of Svirde in Le Maitre Phonetique, January, 1940, pp. 14,15.) 

(6) Distinctions by Tamber combined with Voice-Pitch 

543. Distinctions of this nature have been already noted as 
existing in Pekingese, and they are doubtless to be found in other 
types of Chinese. Examples have been given in §§112, 113 and 257. 
Another is that in Pekingese the triphthong which may be written 


88 Also mi'moirial. 
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iau varies with the four tones. In particular, when said with the 
third (low-rising) tone it has a very open value, which might be 
written narrowly eao or even eao. (A difference of duration is 
also involved. See §557.) 

544. Differentiation by tamber combined with voice-pitch is, 
however, not an essential characteristic of all monosyllabic tone 
languages by any means. It does not appear to be found, for 
instance, in Cantonese, Siamese, Shan or Burmese. 

545. When distinctions are effected by tamber plus voice- 
pitch (tone), it seems clear that the tone is always the predominating 
element and the tamber subsidiary. That this must be the case 
in Pekingese is shown by the fact that in the case of many of the 
syllables of that language there is little or no variation of tamber 
accompanying changes of tone. Thus the syllables “i, / i, / and N i 
show practically no differences of tamber. Furthermore, in cases 
where there are differences of quality I think it is correct to say 
that as long as the words are pronounced with proper tones they 
remain intelligible even though the appropriate variations of tamber 
are not observed; whereas they would not be intelligible if pro¬ 
nounced with exact vowel tambers and incorrect tones. 

546. Differences of tamber such as those mentioned in §543 
would therefore not be shown by differences of symbol in broad 
transcriptions of Pekingese. (See, however, footnote 15 to §257.) 

547. Slight differences of tamber accompany differences of 
tone in Serbo-Croat. (See Fry and Kostic, Serbo-Croat Phonetic 
Reader, pp. 7 and 8, and the diagram on p. 9 of that book. Here 
again the differences of tamber are evidently entirely subsidiary 
to the differences of tone.) 

(7) Distinctions by Duration of a Sound combined 
with Duration of an adjoining Sound 

548. In Chapter XXII (§376, etc.) mention was made of the 
well-known fact that sequences are often distinguished from one 
another solely by the duration of a single sound. We must now 
call attention to another not uncommon use of duration, namely, 
the distinction of sequences by the relative lengths of two or more 
adjacent sounds. 
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549. This manner of using length is found to a limited extent 
in French. The pair of words bst ( bette ) and belt (bete) differs 
chiefly in the length of the vowel, but this difference is accompanied 
by a compensatory difference in the length of the final consonants. 
The t of bet is noticeably longer than that of belt. (See 
§371.) 

550. A more extensive use of the same principle forms a 
characteristic feature of Italian. The usage in that language is 
that when a strongly stressed vowel 39 is long the following 
consonant is short, and that when a strongly stressed vowel is 
short the following consonant is long (“double”). This is exempli¬ 
fied in such pairs of words as 'karo 40 (ca.ro, dear), 'karro ( carro, 
waggon), 'vano 40 (vain), 'vanno (they go), 'seta (silk), 'setta (sect), 

1 belo 40 (belo, bleating), 'bello (hello, beautiful), 'note 40 (notes), 'notte 
(night). 

551. The principle extends to groups formed of a vowel + two 
consonants. If the vowel is long the consonant group is short 
and vice versa. When the consonant-group is long, the length 
either affects one of the consonants only, or is spread over the two, 
according to circumstances. Thus the group dr in 'padre is short, 
while in kwattro, tanto, molto, the consonant groups are long. 
In these three latter words it is the t that takes the length. This 
has to be shown by doubling the t in the phonetic transcription of 
'kwattro, since writing 'kwatro would indicate a lengthening of the 
a (-a:tro being a possible Italian combination). It is, however, 
not necessary to write tantto, moltto, since -a:nto, -o:lto are not 
possible Italian combinations. 

552. The same principle is followed in Swedish and doubtless 
in many other chrone languages. Illustrations from Swedish are 
kail 41 (kal, bald), kail (kali, cold), ta:k 41 (tak, roof), takk (tack, 
thank you), v ka:la 41 (kala, plural of kal), v kalla (kalla, plural of 
kali), a:j; 41 (art, sort), svaft (svart, black), he:t 41 (het, hot, masc. 


38 The conditions apply only in the case of strongly stressed vowels. Weakly 
stressed vowels are always short in Italian, but they may be followed by 
long or by short consonants. Compare ko'nojjere (conoscere), kon'nettere 
(i connettere ), ka'mino (carnino, chimney), kam'mino ( cammino , I walk). 

40 In narrower transcription the stressed vowel would be marked with 
the sign of length: 'ka:ro, 'vaino, etc. 
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and fem.), hett {hett, hot, neut.), V.ra 41 (dra, to honour), v herra 
(herre, master). 

553. In languages exhibiting this peculiarity of pronunciation 
on a considerable scale it is probably best to consider the chronemes 
as being applicable to the consonants only, the differences of vowel 
length being regarded as incidental to the consonant lengths. 
This is in fact the principle followed in the common orthography 
of Italian and (with some exceptions) in Swedish. The alternative 
plan of applying the chroneme to the vowels would not be practicable, 
at any rate for Italian, on account of the fact that in that language 
the lengthening of consonants in pretonic positions is not linked to 
any marked extent with shortness of the preceding vowel. 

(8) Distinctions by Duration combined with Degree 
of Stress 

554. It sometimes happens in stress languages that strong 
stress is accompanied by longer duration than weak stress. 
Examples of this have already been given in §§393, 394. In some 
of the cases observable in English the difference of duration between 
strong and weak syllables is only slight; there is, for instance, 
little if any difference of length between the corresponding vowels 
of the words 'bilou ( billow) and bi'lou (below), or those of 'inkriis 
and in'kriis. Where, however, the long vowel is in a weakly 
stressed syllable immediately preceding a strong stress, the difference 
in length is greater; it may be noticed, for instance, in comparing 
'toiment ( torment , n.) with to : 1 merit (v.) or in the examples in 
§393. 

555. Italian, on the other hand, furnishes very clear examples 
of the association of length with stress. In that language the 
strongly stressed syllables are normally much longer, either in 
vowel or in consonant (§§550, 551), than the weakly stressed 
syllables: thus the second syllable of pa'tata (potato) has a vowel 
considerably longer than either of the others, while the second 


41 It is not certain whether the vowel length-mark can be dispensed with 
in broad transcription of Swedish, or whether a can be written a without 
giving rise to ambiguity. A decision on these points can only be come to 
after an investigation of the lengths and qualities of vowels in weakly stressed 
syllables. 
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syllable of saranno (they will be) is a good deal longer than the 
first and third by reason of the long n . 42 

(9) Distinctions by Duration in combination with a 
Type of Stress 

556. It is probable that differentiations by this combination 
are only found in conjunction with differences of tone. The 
instances quoted from Serbo-Croat in §§560, 583 appear to involve 
differences in types of stress. They are cases in which it is difficult 
to say whether any one of the attributes predominates over the 
others. I am disposed to consider the length and stress as subsidiary 
to the tone. It is, however, worthy of note that Fry and Kostic 
have evolved an adequate phonetic notation for Serbo-Croat on 
the basis of duration and stress. (See §466.) 

(10) Distinctions by Duration combined with 
Voice-Pitch 

557. Differentiations by this combination are found in Pekingese 
and doubtless also in other monosyllabic tone languages of the 
Far East. Words having the third Pekingese tone (the low-rising) 
are said with longer syllabic elements than words of similar sound 
but other tones. Thus, for instance, / mo (smudge) has a noticeably 
longer vowel than "mo (touch) or / mo (ghost) or N mo (ink), and 
the a of ^ma (horse) is longer than the a’s of _ ma (nurse), / ma 
(hemp) and v ma (abuse). This length is clearly subsidiary to the 

42 Weakly stressed syllables in which the vowel is followed by a long 
consonant or by one of certain consonant groups are naturally longer than 
other weakly stressed syllables. Such are the first syllables of kam'mino 
(I walk), ver'ranno (they will come), al'tsare (to raise). 

It is possible that the first syllables of such words as setti'mana (week), 
suppli'mento (addition), lunga'mente (for a long time) might be found 
by mechanical measurements to be nearly or even quite as long as the first 
syllable of 'SEtte (seven), 'tsuppa (soup), 'lungo (long), but it must be observed 
that they are long not only on account of the consonants following the vowel 
but also because they bear a fairly strong degree of stress ("secondary stress”). 

Even final syllables with strong stress may be considered to be long in Italian, 
despite the fact that the vowels of such syllables are actually short. That 
they are felt as long is shown by the fact that when a word beginning with a 
consonant follows, that consonant is lengthened: thus the second syllable of 
fa'ra (he will do) is potentially long; it is actually long in fa'ra b'bsne (he 
will do well). 
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tone, and does not need to be marked in broad transcription. 
(Under some conditions the third tone is also accompanied by special 
tambers. See §§112, 113, 242, 257.) 

558. Serbo-Croat is another language in which words are 
differentiated by a combination of duration and voice-pitch. 
(Type of stress is, however, also involved, as already mentioned in 
§556.) The nature of the lengths, stresses and tones of this language 
have been concisely set out by D. B. Fry and D. Kosnd in the 
introduction to their Serbo-Croat Phonetic Reader, and the following 
particulars are taken from that work and from other information 
given me by Professor Fry. 

559. The Serbo-Croat language contains short, fairly long and 
very long syllabic sounds (i.e. vowels and syllabic r). The fairly 
long and the very long syllabic sounds are distinguished from the 
short syllabic sounds, and from each other, not only by their 
duration but also by special tones. The fairly long sounds have a 
rising tone (here represented by '), while the very long sounds 
are usually 43 said with a rising-falling tone (here represented by 
The short syllabic sounds have no definite tones, but syllables 
containing them may have significant strong stress 44 or significant 
medium stress, 45 or of course they may have stress which is weak 
by comparison with that of neighbouring syllables. Syllables 
bearing the strong stress generally, though not necessarily, have a 
prominent voice-pitch (e.g. a high pitch if the neighbouring syllables 
are low); syllables bearing the medium stress have much the same 
pitch as neighbouring weakly stressed syllables. 

560. The usage is illustrated by the following Serbo-Croat 
words: 

(a) very long syllabic sound with rising-falling tone contrasted 
with strongly stressed short sound: 

‘druga (next, f.), dru:ga (friend), 


13 When words containing very long syllabic sounds occur in unimportant 
positions, they are often said without their characteristic rising-falling tone: 
they become "toneless.” When this happens, the distinction between the 
long sound and a similar short sound with medium stress (§461) would appear 
to be one of duration only. 

44 The strong stress is of the diminuendo type. 

46 The medium stress appears to be of the level or slightly crescendo type. 




182 THE PHONEME: ITS NATURE AND USE 

■kpiga (book, nom. sing.), kpuga (book, gen. pi.), 

Meta (summer, gen.), lirta (flight), 

'toj (this, f. dat.), to:j (this, loc. sing.); 

(b) very long syllabic sound with rising-falling tone contrasted 
with medium-stressed short sound: 

,puna (full, m. gen. sing.), puma (full, f. nom. sing.), 
iprimorju (coast, dat.), prkmorju (coast, loc.), 
izrna (grain, gen. sing.), zP:na (grain, acc. pi.), 
ipix (they, f. dat.), jii:x (they, m. gen.); 

(c) very long syllabic sound with rising-falling tone contrasted 
with fairly long sound with rising tone: 

mo’ra (seas, gen.), mo:ra (must); 

(d) fairly long syllabic sound with rising tone contrasted with 
medium-stressed short syllabic sound: 

sam (to be), sa'm (himself). 

561. Differentiations by combinations of duration and voice- 
pitch are also characteristic of Rhineland German. In that type 
of German there are two types of long vowel and diphthong: the 
moderately long (“unterlang” in the terminology of Professor 
Menzerath) and the fully long ("lang” in Menzerath’s termin¬ 
ology). These are accompanied by special intonations which may 
be termed “high” and “low” respectively. 4 * 


4 * Special combinations of duration and voice-pitch are doubtless to be found 
in some other types of German, and I have in fact been told that something 
of the kind occurs in the German of Munich. The late Professor Paul 
Menzerath of Bonn appeared to hold the view that these or analogous dis¬ 
tinctions are made in most parts of Germany. I am not in a position to 
judge of the extent of the areas over which this type of pronunciation is found. 
But I find it difficult to believe that it is as widespread as Menzerath 
suggested. The experienced phoneticians Vietor, Siebs and E. A. Meyer 
made no mention of it, and they did not indicate it in their transcripts. 
(See Vietor, Deutsches Aussprachewdrterbuch\ Th. Siebs, Deutsche Hoch- 
sprache ; Ernst A. Meyer, Deutsche Gesprdche.) The distinctions appear to 
have been unknown also to H. Klinghardt, the authority on intonation, 
and there is no indication or reference to them in his Obungen in deutschem 
Tonfall. The distinctions also found no place in the sections dealing with 
vowel-length in Sievers’ Grundziige der Phonetik, though he spoke of "long” 
and "over-long” vowels in other connexions (§696 of the fifth edition) and 
stated that the true difference between long and short vowels is that the 
long ones are "prolongable" (“dehnbar”) while the short ones are not. 
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562. These terms have specialized meanings here, as the values 
of the tones vary with the sentence intonation. The two chief 
forms of each are as follows: 

"high” denotes high-falling [\] when the word as a whole 
is said with falling intonation; it denotes mid or low rising 
to high, [ y ] or [/], when the sentence requires that the 
word as a whole should be said with rising intonation; 

"low” denotes mid descending to low [ s ] when the word as a 
whole is said with a falling intonation; it denotes mid or 
low level with or without a slight rise, i.e. intonations of 
the types [_] and [/], when the sentence requires that the 
word as a whole should be said with rising intonation. 

563. In the following paragraphs the vowels of moderate 
length are marked with ■, and those of full length with:. Diphthongs 
with the moderate length are written without mark, and those with 
full length are written with : after the first letter. 

564. The principles involved may be illustrated by the words 
mem (mehr, more), me:r ( Meer , sea), 'Jnaider ( Schneider , instrument 
for cutting), 'Jnaiider ( Schneider , tailor). The common intonations 
of mcr and me:r are: (1) with falling intonation [\] and [ 

■s] 

respectively, (2) with rising intonation [ /] and [/] respectively. 
The intonations of the words Schneider are illustrated by the following 
sentences: 

'hi’r ist der 'Jnaider (instrument) [ \- \.] 

'hi’r ist dsr ‘Jnaiider (tailor) [\-- S- ] 


He made no mention of “ unprolongable ” long vowels. Curiously enough, 
however, he gives in the section on consonant -length ( Grundziige , p. 260) one 
example of a pair of words distinguished by the length of a diphthong: he 
indicated braut (brews) as having a longer U than Braut (bride), but it is not 
clear whether he is there referring to dialect speech. 

I have made enquiry of several Germans from parts other than the 
Rhineland, who have informed me that they have never heard of any such 
distinctions. 

It may be added that, according to Menzerath, the fully long vowels 
and diphthongs are capable of prolongation beyond their normal length. 
They are, in his terminology, “dehnbar.” The vowels and diphthongs of 
moderate length cannot receive extra prolongation; they are not “dehnbar.” 

He has stated that "prolongability" (“dehnbarkeit") is an infallible 
criterion for determining which type of length a vowel or diphthong has 
(see his article on the subject in Le Maitre Phonetique, July, 1934, pp. 09, 70). 
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dtr 'Jnaider ist 'da' \] 

dsr ’Jnaiider ist 'da 1 [•-" \] 

565. Other examples of words which in Rhineland German 


exhibit the above combinations 
moderate length 
with “high” tone 
fid {fid) 
rain {Rhein) 
va'r ( wahr) 

'bauer {Bauer, cage, one who 
builds) 

'tsaden {Zahlen, plur. of Zahl) 
'Jomer ( schbner, adj. masc., 
after ein for instance) 
hi*r (hier) 
i>n {ihn) 

'Jtyle {Stiihle) 

Jtud {SUM) 


of length and voice-pitch are: 
full length 
with “low” tone 
fi:l (viel) 
raiin (rein) 
va:r (war) 

'baiuer (Bauer, peasant) 

'tsailen (zahlen, v.) 

'Joiner (schoner, comparative) 

heir (her) 
tsiil (Ziel) 

'yiber {fiber) 
be'feil (Befehl) 


(11) Distinctions by a Degree of Stress combined 
with a Degree of Stress on an Adjacent Syllable 

566. Differentiations merely by a degree of stress on a single 
syllable are very uncommon. They occur, however, in Serbo- 
Croat, as already mentioned (§461). Occasional examples are also 
to be found in English, where a word with two strong stresses 
may be semantically distinct from another word having the same 
sounds but only one strong stress. I have not yet found any 
convincing examples of this in ordinary Southern English, though 
I think it likely that a few exist. Instances may, however, be 
observed in the speech of those who pronounce the weakly stressed 
prefix re- as rii-; with them remark (observe) and re-mark 
(mark again) are distinguished as rii'maik and 'rii'maikT (There 
is probably a certain difference of duration accompanying this 
distinction by stress.) 

567. On the other hand differentiations by relative stress on 
two neighbouring syllables are fairly frequent. This is the common 
case usually known as differentiation “by the position of the stress,” 
as in the pairs of English words 'bilou (billow), bi'lou (below), 
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'inkrirs ( increase , n.), in'kriis ( increase , v.), 'import ( import , n.), 
im'port ( import , v.). (See §§453 ff.) 

568. The essential difference between such pairs is not merely 
that a particular syllable is “stressed” (to use the convenient but 
inexact way of saying "pronounced with strong stress”), but that 
the neighbouring syllable(s) should be at the same time "un¬ 
stressed” (the common term for "pronounced with weak stress”). 
Stress as commonly understood is thus in reality a combination 
of two attributes: a strong stress on one syllable and a weak stress 
on a neighbouring one. When therefore we say that in a stress 
language two words are distinguished “by the position of the stress,” 
we really mean that they are distinguished by the relative strengths 
of the stresses on two syllables. 

569. The subject of distinctions by strong and weak stress on 
neighbouring syllables has already been discussed at length in 
the chapter on stress (Chap. XXIV). 


(12) Differentiations by Degrees of Stress combined 
with Voice-Pitch 


570. It was pointed out in §§455, 456 that when words are 
differentiated by stress alone, the distinctions are by no means 
clear. In order that distinctions by degrees of stress should be 
clearly perceived by the hearer it is almost essential that the stresses 
should be accompanied by differences in some other attribute. 
Accordingly it is nearly always found in stress languages that strong 
stresses are accompanied by prominent intonations. 


571. Thus, as already indicated in §457, if English words such 
as 'bilou ( billow) and bi'lou (below) or 'import (n.) and im'port (v.) 
are said in isolation with a falling intonation, they have tonal 
forms which may be represented graphically thus: 


[• \] 


'bilou bi'lou 

'import J L im'pori 

And if they are said in isolation with a rising intonation, their 
tonal forms are of the following types: 


-} 


* •' 


bi'lou 

n* 


/] 


'bilou 

'import J - J im'pori 

In fact, in nearly every case, such differentiations in English are 
effected by stress coupled with intonation. (See further, §§457-460.) 
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572. Intonations such as those shown in §571 cannot be regarded 
as “significant word tones” in the generally accepted sense of the 
term. They are special cases of sentence intonation or portions 
of sentence intonations, and such words can be and are said with 
a variety of other sentence intonations. Some of these do not 
help to render the stressed syllables prominent (see §436). 

573. Hence, we find ourselves obliged to regard the stress as 
constituting the essential difference between pairs of English words 
like the above, in spite of the fact that the stresses are generally 
linked with special intonations and that when they are not so 
linked the distinctions are difficult to perceive. 

574. Similar considerations apply to other stress languages, 
such as German, Italian, Spanish, Russian and Greek. The 
relations between stress and intonation in English, German and 
Russian have been investigated in considerable detail, 47 and the 
researches have made it clear that in these languages the stress is 
from the point of view of word-meanings the essential attribute 
and that intonation is subsidiary to it. 48 Similar conditions no 
doubt also prevail in the other stress languages. 

575. In a few languages, of which Norwegian and Swedish are 
the best known, use is made of significant tones which are linked 
to degrees of stress. The tones in these languages take the form of 
tone patterns which may be spread over two or more syllables. 
The salient points or "nuclei” of the tones are always strongly 
stressed, but it seems clear that the stress is subsidiary to the tone. 
If Norwegian or Swedish sentences were said with correct intonation 
but with equal stress on all the syllables, it would I think be found 
that native hearers would perceive the general effect to be sub¬ 
stantially correct. But correct stressings with marked alterations 
in the intonation would, I believe, sound incorrect and might in 
some cases suggest wrong word-meanings. In exemplification of 
this type of tonal usage we give in the following paragraphs a 

47 See R. Kingdon, The Groundwork of English Intonation', D. Jones, 
Outline of English Phonetics; Armstrong and Ward, Handbook of English 
Intonation', H. Klinghardt, Vbungen in deutschem Tonfall', M. L. Barker, 
Handbook of German Intonation', S. Boyanus and N. B. Jopson, Spoken 
Russian. 

41 We are leaving out of account here the important functions of intonation 
as a modifier of sentence meanings. 
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description of the main features of one of the Norwegian tone 
systems, namely, a common South-Western pronunciation. 

576. This form of Norwegian contains two tonal patterns; 
they have been called the "simple tone” and the "compound 
tone” by some writers. The simple tone is used, for instance, in 
the Norwegian words 'boennar (peasants) and the compound tone 
in v bcennar (beans, prayers). The nucleus of the Norwegian 
“simple” tone consists of a low or low-falling pitch (graphically 
[_] or [ v ]) combined with a strong stress. The nucleus of the 
Norwegian "compound” tone consists of a high-falling pitch 
(graphically [ \]) combined with a strong stress. These tones, 
besides having the salient features just mentioned, require that 
preceding and following weakly stressed syllables should have certain 
pitches relative to them. A weakly stressed syllable preceding a 
strongly stressed syllable with “simple” tone has a higher pitch 
than that tone, while a weakly stressed syllable preceding a strongly 
stressed syllable with “compound” tone has a lower pitch than the 
beginning of that tone. The treatment of weakly stressed syllables 
following a strongly stressed syllable depends upon whether they 
terminate a statement or not. If they terminate a statement, they 
are all said on low level pitch; if they do not, they are said on an 
ascending sequence. 49 

577. Statements consisting of two clauses therefore present 
tonal patterns like the following: 

(a) "simple” tone in both clauses [•••'' x .. ] 

(b) "simple” tone in 1st clause and "compound” tone in 2nd 
clause [•..•■ \ . . ] 

(c) “compound” tone in both clauses [ • \ . • • \ .. ] 

(d) "compound” tone in 1st clause and “simple” tone in 2nd 
clause [ ■ \ . •' x .. ] 

578. Each of the tonemes in the type of Norwegian here referred 
to comprises therefore two very markedly different tones, namely, 


*• It would appear, however, that in the pronunciation of many Norwegians 
no difference is made between the tones of unstressed final syllables and 
unstressed non-final syllables; the pitch rises in both cases. In this pronuncia¬ 
tion all clauses, whether final or not, have tones of the types [•_■'](" simple ”) 
or [ • \ - ■ ] ("compound”). 
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the tone used finally in a statement and that used in other situations. 
Each of these in turn comprises sub-varieties depending upon the 
pitch of the syllables preceding and following in the sentence. In 
particular, when several weakly stressed syllables precede a com¬ 
pound tone, the salient part of that tone tends to begin higher 
than when only a single weakly stressed syllable precedes it. 

579. Special sentence intonations can of course be superimposed 
upon what may be called the normal intonation of Norwegian 
sentences. These give rise to still other variants of the tones. 
It is, however, not possible to count such further variants as members 
of the tonemes described above. It appears necessary to consider 
emphatic speech, or any special kind of speech requiring a deviation 
from the “normal” intonation, as another form of the language 
having its own tonemes. 

(13) Differentiations by particular Kinds of Stress 

COMBINED WITH VoiCE-PlTCH 

580. The researches of Miss L. E. Armstrong into the phonetics 
of Somali show that that language is one in which differentiations 
are made by complexes of voice-pitch and types of stress. 50 There 


50 See L. E. Armstrong, The Phonetic Structure of Somali (in Mitteilungen 
des Seminars fur Orientalische Sprachen zu Berlin, XXXVII, Part 3, 1934). 

Prior to Miss Armstrong’s work it was thought that Somali was an ordinary 
stress language, i.e. one in which words were distinguished by the location 
of strong stresses. Previous observers had perhaps not been on the look¬ 
out for tones. Meinhof, for instance, used stress-marks in his transcripts 
of Somali words, but he made no mention of word-tones beyond expressing 
the opinion that there were not likely to be any (see Die Sprache der Hamiten, 
pp. 168, 169). Even Dr. M. von Tiling, who has an excellent ear, made 
no mention of tones in her remarkable works Die Sprache der Jabdrti mit 
besonderer BeriXchsichtigung der Verwandtschaft von Jabdrti und Somali (in 
Zeitschrift fiir Eingeborenen Sprachen, XII, 1, 1922), and Somali-Texte, though 
I once gathered from a conversation with her that certain of her observations 
had made her suspect the existence of tones. 

When Miss Armstrong started work on Somali in 1931, she discovered 
that many words were distinguished from each other by what appeared 
to be tone only. I heard these tones from one of her native collaborators, 
and can testify to the accuracy of her observations in this matter. It 
subsequently proved that, at any rate in some cases, words were differentiated 
not by tones only but by tones combined with types of stress. I heard 
examples of this too from the same native speaker. 
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are in Somali, as far as is at present known, 51 two types of strong 
stress, an even or slightly crescendo type and a diminuendo type. 
The first of these is accompanied by a level or slightly rising tone, 
while the second is accompanied by a high-falling tone. The 
even stress is sometimes of high pitch (marked with ' in the 
Armstrong notation). The falling tone is denoted by ' in the 
Armstrong notation. 

581. It would seem that every syllable containing a long vowel 
has one of these stresses with its appropriate tone. The stresses 
(with appropriate tones) may also apparently be spread over 
two or more syllables containing short vowels. Whether they 
may be spread over sequences containing both long and short 
vowels or more than one long vowel has not yet been ascertained. 

582. The following are examples of the two types of stress in 
Somali. The tones shown are those used when the words are said 
in isolation. 

(1) Level or slightly rising tone with even or crescendo stress: 

{a) on single syllables: ge:s [-] (side), sum [-] (road), 
dim [-] (religion), ce:g [-] (beacon), qam [-] (debt) 52 ; 

(i b ) spread over two syllables with short vowels: inan [ • • ] 
(daughter), bahal [ • • ] (thing), galab [ • • ] (evening), 
soddoh [••] (mother-in-law); 

(c) spread over three syllables: arba'ia [ • • • ] (Wednesday); 

(d) in words of more than one syllable containing one 

long vowel, the pitch of that vowel being high: Saiwa 
[".] (to-night), [".] (wooden dish), qadin [".] 

(young he-camel), ku:Ja f “. ] (the necklace), la:bta [". ] 
(the chest), ge:sta f.] (sides); 

(<?) in words of more than one syllable containing one 
long vowel, the pitch of that vowel being mid: qa:lin 
[-•] (young she-camel), kudo [-•] (necklaces), ga:cj_i 
[-„•] (carriage), feicjo [-■] (ribs), saga:! [■-] (nine), 
qacjom [•-] (bark of tree), xami:s [■■] (Thursday), 
Jabed [■-] (female leopard); 


61 I say, “as far as is at present known,” since some of the differences may 
prove on further investigation to be differences of tone only. 

M These words are often pronounced ge:s [/], sum etc., with high 
and slightly rising tone, when another word follows in the sentence. 
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(/) in words containing two long vowels, see (3). 

(2) Falling tone with diminuendo stress: 

(а) on single syllables: g&s [ \] (horn), su:n [ \] (strap), 
dhn [ \ ] (tortoise), doe:g [ \] (green grass) 83 ; 

(б) spread over two syllables with short vowels: inan [ •. ] 
(son), bahal [ •. ] (wild beast), badag [ •. ] (goose), tic^i 
[ •. ] (she said), waddo [ •. ] (path), sanka [ •. ] (the 
nose); 

(c) in words of more than one syllable containing one 
long vowel: doeigga [ \ . ] (the green grass), tktia [ \ . ] 
(the shower), gd:ska [\] (the horn), ita:l [-\] 
(strength), haj?6:n [• \ ] (night), xamks [• \ ] (long 
shirt), Jab e: l [• \ ] (male leopard); 

(d) in words containing two long vowels, see (3). 

(3) The case of words containing two long vowels: 

(a) When both tones are level and of mid pitch, the stresses 
appear to have about equal force; if there is any difference 
it is that the second has slightly stronger stress than the 
first. 

Example: < ie:sa:n [--] (female kid). 

(b) When both tones are level and one is of high pitch, 
the high-pitched one appears to have the stronger stress. 

Example: na:gta: [" -] (that woman). 

(c) When one tone is falling and the other level, the falling 
tone appears to have the stronger stress. 

Examples: SEisain [- \] (male kid), gkska: [ \_] 
(that horn). 84 


“ Words of this type are apparently always said with approximately the 
same high-falling tone when other words follow in the sentence. 

54 Further investigation of the stress and tone system of Somali is needed. 
It would not surprise me if it were eventually found that the language is 
not a stress language in the ordinary sense of the term, but that it is essentially 
a tone language—one in which special stresses or stress effects are incidental 
to the tones. The effects which have hitherto been regarded as strong stresses 
may prove to be not strong stresses at all but prominences (§§434 ff.), occasioned 
by particular sequences of tone coupled in some cases with length. 

It is at least clear from examples such as those quoted in §582, (1) (a) 
and (2) (a), that the mid-level tone and the falling tone distinguish words. 
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583. Differentiation by complexes of voice-pitch and types of 
stress is also found in Serbo-Croat, but in that language the 
distinctions are helped out by a certain difference in the duration 
of the vowel. The examples moTa (of the seas) and morra (must) 
quoted in §560 (c) illustrate this. The first syllable of mo-ra is 
believed to have crescendo stress; it also has a rising tone. The 
second syllable of mo-ra, when said in isolation has a mid pitch. 
The first syllable of mo:ra has crescendo-diminuendo stress coupled 
with a rising-falling tone, and the second syllable of the word 
pronounced normally in isolation has low pitch. (Length is also 
involved, the o of moira being longer than that of mo'ra, as 
explained in §559.) 


(14) Differentiation by Voice-Pitch on one Syllable 
COMBINED WITH VoiCE-PlTCH ON A NEIGHBOURING SYLLABLE 

or Syllables 

584. This is the same as the case of "tones spread over more 
than one syllable” (§575). It will be observed, for instance, that 
in Norwegian the terminal pitch-sequence [ \. ] by itself is 
ambiguous if the fall on the penultimate is a moderate one. It 
may stand for either the “simple tone” or the "compound tone.” 
When, however, another syllable precedes, there is no longer any 
ambiguity. If the pitch of the preceding syllable is higher than 
that of the penultimate, i.e. if the pitch-sequence is one that would 
be represented graphically by [' x. ]. the tone is the “simple” 
one. If, on the other hand, the pitch of the preceding syllable is 


And the examples in (1) (d) and (2) (c) make it practically certain that the 
high-level tone and the falling tone have significant function. I have heard 
words of these types pronounced by a native; the difference of tone is striking 
to anyone who is on the look-out for it; it is always consistently made. The 
difference in tone between such words as g£:sta and gerska was one of Miss 
Armstrong’s most remarkable discoveries. 

Questions which remain to be solved are (1) whether the distinction between 
the high-level tone and mid-level tones illustrated in (1) ( d ) and (I) (e) are 
used for effecting significant distinctions, (2) whether falling tones as well 
as high-level tones are used on syllables which are long on account of the 
presence of a closing consonant, such as the first syllables of sanka (the 
nose), waddo (path), and (3) whether such tonal forms as [ ” . ], [ \ • ], [ _ ■ ], 
[ • ’ . ] are to be regarded as sequences of tones or as single tones spread 
over more than one syllable. 
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lower than that of the beginning of the penultimate, i.e. if the 
pitch-sequence is of the type [. x . ]. then the tone is the “compound” 
one. 


(15) Differentiations by Voice-Pitch combined with 
Voice-Quality 

585. There exist tone languages in which certain tones are 
accompanied by a special kind of voice-quality, i.e. “breathy” 
voice or “creaky” voice. Such are Burmese and Vietnamese. 

586. Vowels bearing the low-level or falling tones in Burmese 
are always said with breathy voice-quality, while vowels with the 
high-level tone are said with harsh or “creaky" voice-quality. 
Examples of words said with breathy voice are _hle (boat), s pp 
(to speak); an example of a syllable requiring creaky voice is "’pja 
(to show). The usage is explained, with further examples, in the 
Burmese Phonetic Reader by Armstrong and Pe Maung Tin, 
pp. 21-25. 

587. The tones of Vietnamese have been examined from the point 
of view of accompanying voice-quality by Prof. E. Henderson, 
and a short account of them was given by her in Le Maitre Phonetique, 
January, 1943, p. 7. She finds that among other tones there are 
two which may be described as “rising-breathy” and "rising- 
creaky,” and that these differ somewhat in pitch as well as in 
voice-quality, though the difference in voice-quality is the pre¬ 
dominating one. In her words: “The pitch-pattern of the ‘rising- 
breathy’ tone is a short fall from middle to low pitch, a pause on 
low pitch followed by a sharp rise to high pitch. . . . This tone 
is fairly long and is usually accompanied by ‘breathy’ voice. The 
‘rising-creaky’ tone, which is always pronounced with strong 
‘creak,’ consists of the low-creaky tone followed by a quick rise 
or jump to a fairly high pitch.” There are also differences in pitch 
between the “low-breathy” and “low-creaky” tones of the same 
language, the “low-breathy" being accompanied by a slight fall. 
Prof. Henderson has suggested representing these combinations of 
tone and voice-quality by the following signs: u rising-breathy, 
v rising-creaky, s low-breathy, _ low-creaky. She has furnished me 
with a number of Vietnamese words exemplifying them; among them 
are u zai (belt), v zai (dribble), N zai (long), _zai (stupid), ”lap (clear, 
bell-like sound), v laj» (satin), s laji (good), Jaji (frozen). 



DIAPHONES 


193 


CHAPTER XXVII 

DIAPHONES 

Individual Differences of Speech 

588. When one compares the speech of one person with that of 
another speaker of the same language, differences of pronunciation 
are generally to be found. The differences are of various kinds. 

589. In the first place everyone has his own particular quality 
of voice. This sometimes differs considerably from that of others 
and sometimes hardly at all. 

590. Differences of voice-quality are as a rule ignored in phonetic 
work. We regard “speech-sounds” as being so to speak super¬ 
imposed on the speaker’s voice-quality. Although voice-quality 
forms an integral part of every voiced sound uttered in speech, 
we can in imagination subtract that part of the sound-quality 
which is personal to the speaker and examine the remainder of the 
quality as if it had a separate existence. We ignore personal 
voice-quality in the same sort of way as we ignore the scraping 
sound made by the reproducing needle of a gramophone. When 
therefore we judge a certain sound uttered by one speaker to be 
“the same” as one uttered by another speaker, we really mean 
"as nearly alike as their different voice-qualities allow.” 

691. A second type of difference between the speech of one 
person and that of another consists in different distributions of 
phonemes in particular words or sentences. Thus in the South 
of England some people pronounce again as a'gein, while others 
sound it a'gen. Likewise there are two current pronunciations of 
yesterday (‘jestadi and 'jestadei), accomplish (a'komplij and 
a'kAmpliJ), direct (di'rekt and dai'rekt), girl (ga:l and geal) and 
many other words. 

592. Speakers may differ too in the lengths of sounds they use, 
and the differences of length may or may not be accompanied by 
differences of quality. With some English people, for instance, 
the vowel sounds in jam, bad and grand are short; others use the 
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same quality but lengthen the vowel. 1 Similarly in Parisian 
French one hears both bag and ba:g (for bague), tabl and ta:bl 
(for table). In these cases there are differences of length without 
differences of quality. 

593. Again in the South of England we find most people pro¬ 
nouncing off and lost as of, lost, but some pronouncing o:f, !o:st. 
Here the difference is one of length combined with difference of 
quality. A similar case is that of room which is pronounced rum 
by some and ru:m by others. 

594. In stress languages individual speakers sometimes show 
variations in the manner of stressing words. Thus in Southern 
English one hears exquisite pronounced as 'ekskwizit and eks'kwizit, 
formidable as 'foimidabl and foi'midabl, dirigible as 'diridjabl and 
di'rid3abl, contents (n.) as 'kontents and kan'tents, adult as 'adAlt 
and a'dAlt. In the North we find magazine pronounced 'magazun, 
criticize pronounced kriti'saiz, yesterday pronounced jests'dei, where 
the South has maga'ziin, 1 * kritisaiz, 'jestadi or 'jestadei. 

595. Differences in intonation are also common. People’s pitch 
limits vary, for instance. Moreover different speakers often use 
different forms of sentence-intonation, some of which may be 
individual or local varieties of well-known intonation patterns. 
They may differ too to some extent in the choice of intonation 
pattern used in particular contexts. 8 There are also special 
intonation patterns characteristic of the speech of different parts 

1 Lengthening of this vowel is very common in London English (see §130 
and footnote 8 to §513). Those who lengthen it have one phoneme more 
than those who always keep it short, since their a-sound in these words 
cannot be regarded as the short of the a-sound they use in calm, large, etc. 
A broad transcription of their pronunciation requires the introduction of an 
additional letter. Either the letter a must be used in calm, large, etc. (thus 
ka:m, Ia:d3, etc.), or a special letter (e or as) must be used in jam, grand, 
back, etc. (thus djEim, gremd, etc., or djaeim, graeind, etc.). Those who 
have in addition to the above two long sounds another long a:, distinct 
from these, replacing the more normal ai3 (see §606, also footnote 11 to §249, 
and my Outline of English Phonetics, §414) have two phonemes more than 
those with the pronunciation I have been recording throughout this book. 
To represent their pronunciation broadly necessitates the use of two letters 
besides a, viz. £ (or ae) and a. Had (strongly stressed), hired and hard would 
need a transcription such as held (or haeid), ha:d, ha:d. 

2 Notice, for instance, how different B.B.C. news readers say "That is the 

end of the news ”, 
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of the country—patterns which it would be difficult to explain 
as variants of any of the well-known Southern English typical 
patterns. 

596. Lastly, the actual sounds used by one person are often 
found to differ from the corresponding sounds used by other speakers 
of the same language. Thus if one asks a number of Englishmen 
to say the word get, one usually finds quite a number of different 
kinds of e among them. Such differences are often said to con¬ 
stitute “free variations.” They must be carefully distinguished 
from “allophones” (members of phonemes). 

597. It is with such “free variations” that the present chapter 
is concerned. 

The Nature of Diaphones 

598. It is convenient to have a name for a family of sounds 
consisting of the sound used by one speaker in a particular set of 
words (said in isolation) together with corresponding though 
different sounds used in them by other speakers of the same language. 
Such a family may be termed a “diaphone.” 

599. The following are some examples of diaphones containing 
several sounds. 



Fig. 10. Diaphonic Area of the English Vowel in get. 

600. A typical instance is that of English “short e” referred 
to in §596. The various shades of e which may be heard from 
different English speakers in such words as get, fetch, egg are roughly 
as shown in the annexed diagram (Fig. 10). It is probable that 
any sound included in the shaded area of this diagram will be found 
to be some English person’s “short e.” 
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601. Another instance, which is noteworthy on account of the 
unusually large number of sounds included in the diaphone, is 
the case of the vowel sounds used by different English speakers 
in such words as home, go. These include a monophthongal o: 
(Scotland and Northern England), a diphthong of the ou-type 
beginning with a sound near to cardinal o (Northern, Western), 
and (in the South) several other kinds of diphthong beginning 
with opener kinds of o and with lesser degrees of lip-rounding, 
extending as far as ou. In the South too there are yet other types 
of diphthong in common use in these words. Some of them start 
with spread or nearly spread lips, and they include sounds of the 
types au , 3 au, au (London), ou (“clerical” accent). All the above 
sounds can be grouped together as “free variants” or members of 
a single diaphone. 

602. An example of a consonantal diaphone is seen in the 
English J. Its quality varies noticeably from speaker to speaker, 
the variations depending partly upon the position taken up by 
the tongue, partly upon the point at which the articulation is made 
and partly upon the position of the lips (which with some are pro¬ 
truded after the manner shown in Figure 101 of my Outline of 
English Phonetics and with others are spread). 

603. Examples from foreign languages of diaphones of fairly 
wide extent are the German long a:, the Spanish sound of ll and 
the Hindi sound ai. 4 The German a: varies from about cardinal 5 
(Hamburg, Liibeck, etc.) to about cardinal 4 (Berlin). The Spanish 
diaphone written ll comprises three very distinct members, X, j and 
(especially in South America) 3 . The sound of Hindi ai varies from 
a well marked diphthong which could be represented narrowly 
by At (Lucknow) to a monophthongal or almost monophthongal e: 
(Lahore). 

604. Diaphones may comprise subsidiary members of phonemes 
without the corresponding principal members. For instance, the 
kind of t I use in position of weakest stress, as in 'beta {better ), 
■geti q {getting), is a weakly articulated sound without much 
aspiration—nearly d in fact. But there are other speakers who 
use ? in this position and pronounce 'be?a, 'gePin, though their 


3 My OU is rather like this. 

4 Formerly written ai in romanic systems. 
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t’s in other positions are similar to mine: the diaphonic variants 
are thus confined to subsidiary members of the t-phoneme. 5 

605. Style of pronunciation has to be considered in connexion 
with the idea of the diaphone. As is well known, most people use 
more than one style of pronunciation. (Mention has been made of 
this in §28.) The style depends upon the circumstances under which 
they are speaking. There is what we may call the "ordinary” or 
“slow conversational" style,® there are very rapid familiar styles, and 
there is formal style used for instance in reciting or reading aloud 
to a large audience, and there are styles intermediate between 
these. With some speakers there is little difference between the 
styles beyond the use of a certain number of abbreviations and 
assimilations in rapid familiar style which would not be made in 
slower styles. But with others the differences are considerable 
and may involve the use in formal and in very rapid styles of sounds 
not occurring at all in the “ordinary” style. When such additional 
sounds are introduced, they generally constitute special phonemes. 
They are also included in diaphones with the sounds which would 
replace them in “ordinary” style. 

606. The case is exemplified by pronunciations of words like 
fire, tired. There are English people who in deliberate speech say 
such words with a disyllabic vowel sequence of the type aia, but 
who in rapid conversational speech substitute a single diphthongal 
sound aa or a particular variety of monophthongal a:—sounds 
which are non-existent in their deliberate style. These sounds 
constitute separate phonemes, though they belong to the same 
diaphone as the aia of their deliberate style of speech. 7 

607. Another example of the use of special sounds in particular 
styles of speech may be observed in the treatment of the English 
words which have weak forms containing a as well as strong forms 
with other vowels Thus one occasionally hears such words as 

s I am assuming that the ? of these speakers is a member of their t-phoneme. 
That it is so is probable, though this has not yet been proved to my knowledge. 

•What P. Passy called "prononciation familifere ralentie.” 

7 With those who reduce aia to a: this sound is a phoneme separate from 
the common long ai (of far, tarred, etc.), and the two sounds have to be 
distinguished in broad transcription of the rapid style. This is best done 
by using the symbol a to denote the common long a (thus fa:, ta:d) and 
transcribing fire, tired, etc., with a: (fa:, ta:d). See also footnote 1 to §592. 
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and, have pronounced in formal speech with a vowel intermediate 
between the ordinary short a and a; this happens in situations 
where conversational speech would have an(d) or n(d) and (h)av. 
This intermediate sound is an "obscured” a, which may be written 
phonetically a. It is probably not to be regarded as a separate 
phoneme, since the style of speech in which it occurs is almost 
certainly erratic and therefore not reducible to phonemes (see 
Chap. XXVIII). The sound would, however, belong to the same 
diaphone as the ordinary short a. 

608. Variphones (§628) are presumably to be treated as particular 
cases of diaphones. 

609. Varieties of sound heard in erratic pronunciation (Chap. 
XXVIII) are sometimes groupable into diaphones. Those whose 
pronunciation is erratic generally speak with a mixture of dialect 
and if they sometimes use the sound of one manner of speech, 
and sometimes the corresponding sound of another manner of 
speech, the two sounds can be grouped together into a diaphone 
just as they would if the two manners of speaking were used con¬ 
sistently by two different speakers. (Groups of this nature may 
perhaps sometimes be considered as particular types of variphone. 
See §648.) 

Overlapping of Diaphones 

610. Overlapping of diaphones is of common occurrence, i.e. 
a particular sound may be the one used in one set of words by 
one speaker but in another set of words by another speaker. This 
is especially liable to happen, when a sound lies near the limit of 
a diaphonic "area.” For instance, the English e and a diaphones 
overlap after the manner shown in Figure 11. This means that 
a high variety of a ([a*] or [»]) may be the sound used by one 
speaker in words like bad, back, jam, but the same sound may be 
a diaphonic variant of e and may be heard from some speakers 
(chiefly Northern) in bed, beck, gem. 

611. The English ou and au diaphones likewise overlap. In 
Scottish English words like mouth, about are pronounced with au, 
while, as we have already seen (§601), au is one of the variants 
in the ou-diaphone. If a Londoner were to hear someone say 
a'baut, without knowing anything of the other sounds used by 
the speaker, he would naturally interpret it as a boat. But if 
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a Scotsman heard the same utterance, he would take it to mean 
about. 



Fig. 11. Overlapping of the English short e and a Diaphones. 

612. Similarly the sounds [haei] and [baeit] mean high and bite 
to some but hay and bait to others. Some people’s way of pro¬ 
nouncing pa'zijn ( position ) is barely distinguishable from other 
people’s pronunciation of pa'zejn ( possession ). The American way 
of saying possible is much the same as the Southern English 
pronunciation of passable. And so on. 

613. It will be seen then that sounds lying near to the limits 
of diaphonic "areas” are recognized as belonging to one diaphone 
or another not so much by their inherent quality as by their relation¬ 
ships to the speaker’s other sounds, and by the contexts in which 
the words are used. If, as is to be assumed, the speaker’s sentences 
are intelligible, the correspondences of the sounds he uses with 
those used by others are inferred from the general effect of his 
speech. Single words said by him in isolation may be ambiguous to 
people who use diaphonic variants remote from his, but they become 
intelligible as soon as they are said in sequence with other words. 

614. The following is an actual case which was once brought 
to my notice. A child was evacuated from London in 1940 and 
sent to live with a family in the country who use a pronunciation 
similar to mine. When asked his name by the children of the 
family, he gave it as “d 3 aimz.” The children took this to be a 
name they had never heard of before—a name that they would 
naturally have spelt Jimes if they had been asked to write it. It 
had to be explained to them that what the boy said was the London 
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way of pronouncing James. But when this London boy carried 
on a connected conversation, he was understood without any 
difficulty. No doubt all his other vowels were proportionately 
different from those who pronounce d 3 eimz, and the necessary 
allowances are instinctively made by hearers in such a case. 
Context evidently plays a very large part in ensuring intelligibility 
between people who use widely differing members of diaphones. 8 

Sounds belonging to Two Diaphones 

615. A sound may belong to two diaphones of the same language. 
One case is that just mentioned (§§610-614) where there is over¬ 
lapping of diaphones. Another is when a single sound of the speech 
of one person corresponds to two sounds of the speech of others. 

616. The pronunciations of or in English afford an illustration 
of the latter case. Without entering into details of all the variant 
pronunciations heard in words written with or, which are rather 
involved, we may exemplify the point under consideration by the 
pronunciation of those who use an open o: [d:] in all such words as 
form, order, port, force, board and of those who divide such words 
into two categories, viz. (1) woids such as form, order, north in 
which they use the open o: [d:] and (2) words such as port, force, 
board, course in which they use a diphthong oa beginning with a 
close o. 9 There is, in the words of the first category, a diaphone 

8 The extent to which the employment of unusual sounds can be carried 
without interfering with intelligibility is illustrated by a case of a peculiar 
type which once came under my observation. The speaker was a man whose 
vowels were mostly considerably lower than those normally heard. His 
(so-called) i: was practically ei, his short i resembled most people's short e, 
his short e was a kind of e, his (so-called) u: was on the o level, and so on. 
The relations of his vowels to each other were of the same order as those of 
normal speakers, but the distances between them were less than usual. His 
whole vowel system was in fact compressed into an area below the e-O line 
on the vowel diagram. Everyone noticed that his speech was peculiar. 
It sounded somewhat blurred, but as a rule there was no difficulty_in under¬ 
standing what he said. 

* For lists of the words in the two categories, see Ripman and Archer’s 
New Spelling, Appendix VII, or my Outline of English Phonetics, footnote 36 
to §308. 

Another diphthong 33 beginning with an open o (3) is also used by many 
English speakers in some of these words. We are leaving this out of consider¬ 
ation in the present discussion. 
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consisting of a monophthongal o: [a:] with slight quality variants. 
In the words of the second category there is a different diaphone 
which includes not only the members of the diaphone just mentioned 
but also diphthongal sounds of the oa-type. The Southern English 
monophthongal o: [a:] thus belongs to two diaphones. It should 
be noted that in the speech of those who use oa in words of the 
second category the sounds o: [a:] and oa belong to separate 
phonemes; they distinguish several pairs of words, e.g. 'bo:da 
{border), 'boada [boarder), bom {born), boan [borne), mom {morn), 
moan [mourn), ho:s {horse), hoas {hoarse). 

617. The case is further exemplified by words like food, boot, 
good, foot, except that the sounds involved are not different phonemes 
but are members of the same phoneme conditioned by length. 
Most English people distinguish two categories of oo-words, (1) food, 
boot and other words, which are said with long u:, and (2) good, 
foot and other words, which are said with a short and opener u [o]. 
Scottish people, however, do not as a rule make this distinction; 
they use a close short u in all these words. 10 We find therefore 
two diaphones. There is firstly what we may call for convenience 
the long u: diaphone which comprises u: and certain variants 
(uw, ou, etc.), together with the Scottish u and its variants. 
Secondly, there is what we may term the short u-diaphone, which 
comprises the English short open u [o] and also the Scottish u 
with its variants. Every variant of the Scottish u thus belongs 
to both of these diaphones. 

618. Yet another example of a sound belonging to more than one 
diaphone is the glottal stop (?) which in English belongs to three 
diaphones. It is used by many where more normal pronunciation 
has a variety of t (e.g. 'be?a for 'beta), by some where the normal 
pronunciation has k (e.g. 'bei?a for 'beika) and by a few where the 
normal pronunciation has p (e.g. 1 pi:?l for 'pirpl). ? thus belongs 
to the t, k and p diaphones in certain types of English. 

Diaphones and Phonemes 

619. It must be observed here that the users of particular 
diaphonic variants have their own phonemes, which differ from 
those of people using other diaphonic variants. Thus when we 

10 In Scotland the sound heard is generally of a central type [U], but the 
quality varies with individuals. 
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say that a certain English person’s short e is particularly close, 
we mean that his principal e (as in get, fetch, egg ) is a rather close 
sound, perhaps not much opener than cardinal No. 2; he will have 
also subsidiary members of the phoneme, the chief of which will be 
a moderately open e used in such a word as help (see §88). When, 
on the other hand, we say that a person’s short e is an open one, 
we mean that his principal e, as in get, is an open one—it may be 
as open as cardinal No. 3, or even opener—and that his subsidiary 
e in help is opener still. 

620. The use of particular diaphonic variants may involve 
unusual phonemic distributions. Thus if we assume that the ? 
of those who pronounce 'be?a, 1 bei?a, *pi:?l (§618) is not a separate 
phoneme, 11 this sound has to be assigned to the same phoneme 
as either t, k or p; t would presumably be the most appropriate. 12 
The words baker and people are thus said by these speakers with 
unusual pronunciation which would appear in a broad transcription 
as 'beita, 1 pi:tl. 

621. It is sometimes difficult to determine whether two sounds 
are merely diaphonic variants or whether they belong to different 
phonemes. The difficulty is illustrated by the a-sounds of Northern 
French, since there is considerable fluctuation in the values of the 
sounds and their distribution in words. The position is clear 
enough in the type of French recorded by Paul Passy. That 
variety of French comprises a front a, a back a and some shades 
of sound intermediate between these. The sounds are clearly 
assignable to two phonemes since the front a and the fully back a 
both occur long and short in similar phonetic contexts, as is shown 
by the following examples from Passy’s pronunciation: mwa (wot), 
mwa ( mois ), la [la, la), la (las), wat ( ouate ), drwat ( droite), tirwair 
(tiroir), krwa:r (croire). In the same type of French intermediate 
a-sounds occur in certain non-final syllables. Those which are 
definitely a-like are assignable to the a-phoneme, e.g. those in 

11 This has not yet been proved. To demonstrate it would necessitate 
an exhaustive investigation of the conditions under which ? and t-sounds 
occur in weakly stressed positions in the pronunciation of the speakers who 
use ?. (See further footnote 3 to §712.) 

la Although the ? belongs diaphonically to k and p as well as to t, it would 
be inadvisable to assign it phonemically to more than one of these for the 
reasons given in Chapter XX. 
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gate ( gdter ), kase (casser), krware ( croirais ), while the others, 
including “middle” a’s such as those in gars5 ( gargon ), artikl ( article) 
are assignable to the a-phoneme. 13 

622. I believe Passy’s speech to have, been a very usual type 
of Northern French, and, owing to its consistency and the definiteness 
of its phonemes, it is a form of French particularly well suited for 
practical teaching of the language. There are, however, other 
varieties of Northern French differing considerably from his in 
the matter of the a-sounds. I have known one Parisian of Parisian 
parentage who had no front a at all, and who always used a back a. 
On the other hand it appears that there are some who have no 
back a, and who use a front or middle a exclusively. Others 
again use more than one a-sound, but distribute the sounds in 
words in a manner different from that described by Passy. For 
instance, it is not uncommon to hear Northern French people 
using back a in every terminal position (e.g. in mot as well as in 
mois) and a front a in various other positions (in casser, for example). 
(See also §131.) 

623. The existence of these variations of pronunciation has as 
a result that a and a and intermediate sounds have to be grouped 
together into a single diaphone in French, but that at the same 
time the sounds are in the speech of many separable into two 
phonemes. 

Bearing of the Theory of Diaphones on the Theory 
of Phonemes 

624. It has been necessary to deal at some length with the theory 
of diaphones in order to make clear the distinction between dia¬ 
phones and the allophones (members) of phonemes, and to show 
why it is necessary to take as a basis of the definition of the phoneme 
the pronunciation of a single individual speaking in one particular 
and consistent style (§28). 

624a. Some of those interested in the nature of the phoneme 
have attempted to find a place for diaphones (which they generally 

“ In a great number of instances we find, in this type of French, non-final 
syllables containing a front a indistinguishable from the a of final syllables. 
Such are the first syllables of Jase (chasser), frape (frapper) , ale (alter). It 
would seem that the commonest case of "middle a” belonging to the front 
a-phoneme is where r follows. 
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call “free variants”) within the definition of the phoneme. No 
such attempt was made by the initiators of the phoneme theory 
(see Appendix I), nor indeed in my opinion can any such attempt 
possibly succeed. That this is so is evident from several of the 
considerations put forward above, and in particular from the fact 
that the circumstances under which two phones fall into one 
diaphone (§§598 ff.) have nothing in common with the circumstances 
under which two phones may be members (allophones) of a single 
phoneme as originally conceived. 

6246. The impossibility of incorporating the conception of the 
diaphone into the definition of the phoneme is likewise shown by the 
fairly common occurrence of a phone belonging to two diaphones 
(§§610-618), and the comparative rarity of possibilities of assigning 
a sound to one of two or more phonemes. (See, for Japanese, 
§323 ff., for Pekingese, §195, 196; also Y. R. Chao’s The Non¬ 
uniqueness of Phonetic Solutions of Phonetic Systems .) 

625. It is conceivable that some sort of "mentalistic” definition 
of a phoneme could be devised which would include diaphonic 
variants (see Chap. XXIX). It might, for instance, be main¬ 
tained that a phoneme is an “ideal sound” 14 which is "realized” 
in one way by one person under a given set of conditions, and in 
another way by the same person under other conditions and yet 
in other ways by other persons. I am not at present prepared to 
formulate any definition of this nature. Psychologists may in 
due course be able to determine whether any such definition is 
possible. I would only say here that there appear to be difficulties 
in the way. It would be necessary to show, for instance, that 
different uttered sounds are realizations of the same ideal sound 
and not realizations of different ideal sounds. And even if it 
proves possible to do this for one speaker, it may prove difficult 
or impossible to show that an ideal sound of one speaker is identical 
with that of another. 

626. I would conclude this chapter by remarking that the 
idea of the diaphone is of limited application only, and cannot 
like the idea of the phoneme be elaborated into a complete and 
universal theory applicable to all consistent types of speech. This 
is due mainly to the difficulty of deciding the extent to which 
deviations from a type are to be admitted as belonging to “the 

14 Presumably an abstraction of the third or fourth degree. (See my paper 
on Concrete and Abstract Sounds in the Proceedings of the Third International 
Congress of Phonetic Sciences (Ghent, 1938), pp. 4-6.) 
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same language” as that type, and at what points the limits must 
be set beyond which deviating sounds are to be regarded as belonging 
to separate languages or dialects and therefore not to be comprised 
in the diaphones of the language under consideration. 15 

CHAPTER XXVIII 

ERRATIC PRONUNCIATION 

627. It is presumably a fact that a person never pronounces 
a given word twice in precisely the same way, even in the same 
context. 1 He may be said to aim at the same pronunciation, and 
generally produces what may roughly be called “the same” sounds, 
but every utterance doubtless differs from every other in details 
which are usually too minute to detect either by ear or by apparatus. 
With some speakers, however, involuntary variations of sound are 
clearly perceptible to outside observers. Such variations are 
generally found in one of three cases: (1) when the language is one 
containing a relatively small number of phonemes and where 
consequently absolute precision in the pronunciation of certain 
sounds is not essential, (2) when a person speaks with a mixture 
of dialect, (3) when a person has made intentional changes in his 
pronunciation. In all these cases speakers are inconsistent in their 
pronunciation, and use variants apparently at random. 

Variphones 

628. To denote a group of sounds coming under (1) I have 
suggested the term variphone . a Variphones are thus “unstable” 
sounds—sounds which are liable to “variation” independently of 
their phonetic context. 

629. One of the most noteworthy cases of a variphone is “the 
Japanese r.” In the pronunciation of many if not most Japanese 
this “sound" is very variable; they sometimes use a sound resembling 


16 It will be observed that in discussing diaphones the term “a language” 
is used in its common and rather vague sense, and not in the restricted sense 
attributed to it in the definition of the phoneme (§28). 

1 If there is such a thing as " the same context.” 

2 Some critics have objected to this hybrid term, but as far as I know no 
one has yet suggested any other single word which meets the case. H. E. 
Palmer employed the term "free phoneme” (Principles of Romanization, 
p. 55); this, however, has the disadvantage of requiring the use of " contactual 
phoneme” or "bound phoneme” to denote what is here termed simply a 
"phoneme.” 
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an English fricative r (j), sometimes a lingual flap (r), sometimes a 
kind of retroflex d (<J), sometimes a kind of I, and sometimes sounds 
intermediate between these. One and the same speaker will use 
all these forms indiscriminately; he will pronounce, say kore (this) 
as koje, kore, kocje, kole, or with intermediate pronunciation 
without being aware that his pronunciation varies, j, r, c[, I, 
etc., may be called "members” of the variphone. In the absence 
of special training Japanese speakers can neither hear the difference 
between these members nor make any one of them at will. (One 
result of this is the well-known difficulty they have in hearing or 
making the difference between r and 1 when they speak European 
languages.) 

630. Variphones consisting of r-like and l-like sounds would 
also appear to occur in some languages of Africa, for instance, in 
Bambara and Malinke . 3 (The case must be distinguished from the 
use of r-like and l-like sounds as members of the same phoneme. 
See §§81-83.) 

631. The following are some other examples of variphones. 
gg and g form a variphone in the speech of Midland districts of 
England (Birmingham, Chester, etc.). Many English people from 
these districts use the two indiscriminately, both in words such as 
finger, longest (elsewhere 'figga, 'loggist) and in words like singer, 
longing (elsewhere 'siga, Mogig). Without training they cannot 
hear the difference between gg and g, nor can they make the 
difference at will. 

632. In Saxon German, and apparently in some types of South 
German, p, tp and voiced b constitute a variphone. Saxon Germans 
commonly use these sounds indiscriminately; they will pronounce 
Blatt indiscriminately as plat, blat or blat and they will pronounce 
filatt likewise; so also with all the other words written with fi or b. 
t/^/d, and s/j/z, etc., likewise constitute variphones in these types 
of German . 4 

633. In initial position d and 8 appear to form a variphone 
in Spanish, the two sounds being used indiscriminately by many 
Spaniards in this situation. Thus when the word doce is initial 

•Referring to these languages, Westermann and Ward say: "I and r 
are interchangeable in a great many words, dialectally and even with 
individuals.” (Practical Phonetics for Students of African Languages, p. 184.) 

4 I have heard a South German university professor pronounce schon 
sometimes Join and sometimes 30 m, the presence or absence of voice being 
apparently immaterial. 
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it is pronounced both 'do0e and '3o0e; both pronunciations may be 
heard from the same speaker. Similarly with b and (3, g and y. 

634. Variphones consisting of mb and b, nd and d, etc., are 
found in varieties of Greek and in Kikuyu. Many, if not most, 
Greeks will pronounce crapdvTa (forty) indiscriminately as sa'randa 
or sa'rada in ordinary conversation, 5 6 and they will pronounce 
Sfcv £epco (I don’t know) sometimes as deg'gzero and sometimes as 
Se'gzero. It appears that without special training these Greeks 
can neither hear nor make the difference between nd and d, gg 
and g, etc. When learning English without phonetic training they 
can neither hear nor make any difference between band and bad, 
but pronounce both words as band and bad indiscriminately. 

635. In Kikuyu, as has already been observed, mb, nd, etc., 
appear to be felt by the natives to be single sounds, and in initial 
position simple b, d, etc., are sometimes unconsciously substituted 
for them. 6 (Plosive b, d, etc., do not occur as separate phonemes 
in the ordinary form of the language; see §79.) Thus mborl (goat), 
ndere (mortar) are sometimes pronounced [bori], [dere] with 
plosive b and d. 7 

636. Variphones have a place in the phoneme theory. For 
the purpose of that theory the members of a variphone count as 
if they were a single sound, which may itself constitute a phoneme 
or which may be a member of a phoneme. The Japanese r, for 
instance, is a variphone the members of which do not appear to 
be in any way conditioned by phonetic context; it is therefore a 
variphone which itself constitutes a phoneme. 8 On the other 


5 Some Greeks when trying to speak very precisely will use a spelling 
pronunciation sa 1 rants, but this would appear to be unnatural to most 
Greeks and in any case belongs to a different style of speech which we are 
not concerned with here. 

• Armstrong, Phonetic and Tonal Structure of Kikuyu, §90. 

7 The transcriptions [bori], [dere] here are narrow. In broad transcription 
of Kikuyu b, d (without preceding m, n) stand for (3, 3. 

* At least this is what seems to be the case in normal Japanese. Some 
observers, however, have maintained that the members are to some extent 
conditioned by phonetic context, e.g. that a more l-like sound is used before O. 
They maintain in fact that there is a Japanese r-phoneme having two or 
more of the above-mentioned sounds as members. This may be true of 
some types of Japanese, but there certainly appear to exist other types, 
where no such system is found, i.e. where the sounds mentioned in §629 
constitute a true variphone and are not connected by any phonemic relation. 
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hand, in the pronunciation of those Spanish speakers who use d 
and 9 indifferently in initial position, this variphone d/9 is a member 
of their d-phoneme; the members of this phoneme are then d 
(used after n), 9 (used in other medial positions and finally), and 
the variphone d/9 (used initially). 

637. It is worthy of note that sometimes if we take account 
of variphonic forms, we find ourselves obliged to add to the number 
of phonemes in a language. Thus Kikuyu, as normally recorded, 
may be said to contain two labial consonant-phonemes; they may 
be written m and b, but it has to be remembered that the principal 
member of the latter phoneme is p. There exists also mb, which, 
though a ‘ single sound” in the conception of the native, may, 
according to one view (§79), be regarded as a composite of the 
m and b phonemes; ordinary plosive b may be taken as being the 
member of the b-phoneme used when m precedes. But, as has 
already been mentioned (§635), many Kikuyu speakers use a vari¬ 
phone mb/b in initial positions. An analysis of their speech must 
necessarily admit three labial consonant-phonemes, the variphone 
mb/b having to be regarded as a separate phoneme. Fortunately 
this need not complicate the transcription, since the variphone 
may without ambiguity be written mb. The same transcription 
is arrived at if we simply ignore the variant b for initial mb, as 
being unessential to the pronunciation of the language. If, however, 
we were to ignore the other variphonic form mb, the analysis would 
give three phonemes requiring three separate letters in transcription 
(m, p and b). 


Mixture of Dialect 

638. One frequently meets with people whose speech has been 
influenced by two or more environments, and who consequently 
speak with mixed dialect. Their speech often shows irregularities 
and inconsistencies. There are, for instance, people of Scottish 
origin who have lived for a long time in England and who will 
pronounce the vowel in such a word as home sometimes in the 
Scottish way and sometimes in an English way or an approximation 
to an English way; they will sometimes make a difference between 
the vowels in food and good and sometimes not; they will sometimes 
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sound an r in such a word as burn and sometimes not, without 
any apparent system. 9 

639. The following are a few other examples. There are English 
people who pronounce words like slightly, mutton sometimes with a 
glottal stop ('slaiPli, 'mA?n) and sometimes with t ('slaitli, 'mAtn). 
I have known one speaker of Irish origin who sometimes pronounced 
the English 0 in the normal English way and sometimes substituted 
a dental t for it, as is done in Irish dialects of English. French 
people may be met with in Paris who speak with traces of 
Southern accent, e.g. who will sometimes pronounce donner 
with an o-like vowel in place of ordinary Parisian o. Swiss 
speakers of French who have lived for a long time in France 
often lose some of the characteristic features of Swiss French 
pronunciation but occasionally revert to them without knowing it, 
and so on. 

640. It would seem that erratic pronunciation due to mixture 
of dialect is very prevalent in the United States. For example, 
there are Americans who sometimes make a distinction between 
Mary and merry and sometimes not, who are inconsistent in their 
use of a, d and o, of ae, a and a, and so on. 10 


* There is some evidence to show that excitable people who have a mixed 
pronunciation tend under stress of emotion to revert to the pronunciation 
they acquired in the first instance. If this could be proved, it would show a 
sort of system in their manner of speaking; the variations might be found to 
accord in some way with the "context of situation.” The speaker would in 
fact sometimes talk in one "dialect” and sometimes in another. 

10 Examples are to be found in transcriptions of records that have been 
given in various issues of American Speech. For instance, a speaker whose 
pronunciation was recorded in the issue of February, 1941, p. 40, is seen to 
have erratic pronunciation of "stressed er”; he uses three sounds (there 
written with the symbols 3, 3f and a:) indiscriminately, where a consistent 
speaker would use only one. Another speaker whose pronunciation was trans¬ 
cribed on p. 41 of the same issue uses two vowels (there written 3 and ce) 
indiscriminately in similar cases. The "stressed er” of another speaker, a 
specimen of whose pronunciation was given on p. 179 of the issue of October, 
1942, is sometimes a variety of 3^ (there written T) and sometimes a diphthong 
of the type at (there written 31 ). In the issue of October, 1939, a speaker 
was represented as pronouncing the word hole in four different ways: hool, 
hoxol, hoxl and hoxisl. 
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Intentional Changes of Pronunciation 

641. Erratic pronunciations are not unfrequently met with 
which originate in voluntary attempts to alter one’s pronunciation. 
After a time such voluntary changes may become definitely fixed 
so that the new pronunciation comes to be used involuntarily. 
More often, however, the speaker’s pronunciation becomes in the 
end erratic; he sometimes uses the new pronunciation and sometimes 
the old without being aware of the variation in his speech. 

642. Intentional changes of pronunciation are made for a 
variety of reasons. The commonest is presumably where a person 
comes into an environment where some feature of his original 
pronunciation might give rise to misunderstanding or is very 
unusual. Thus, as the sequence 'pasabl means passable in Southern 
England and possible in America, an Englishman permanently 
resident in America might well make intentional adjustments in 
his manner of saying these words. If he did so, it is likely that at 
times he would unintentionally revert to his original English 
pronunciation. 

643. People not unfrequently change or attempt to change 
their pronunciation purposely for other reasons. Sometimes they 
believe (because they have been so taught) that some other pro¬ 
nunciation is "better” than what they acquired naturally in 
childhood; sometimes they try to make their speech correspond 
more nearly to current spelling; sometimes they imitate the speech 
of others whom for some reason they consider worthy of imitation; 
sometimes they adopt or try to adopt the recommendations of 
some dictionary; sometimes they follow or try to follow some 
tradition. I have even known a man alter his pronunciation 
purposely so as to make his language more consistent for a pedago¬ 
gical reason. In all such cases erratic pronunciation is liable to 
result; the use of the new pronunciation does not always become 
absolutely habitual, but the speaker reverts from time to time 
to his "natural” pronunciation. 

644. An example not unfrequently met with in France is the 
case of people whose pronunciation of such words as miler, aimable 
varies, people who pronounce the first syllable sometimes with an 
open £ and sometimes with a “middle e.” Those who have this 
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erratic pronunciation are doubtless people whose "natural” pro¬ 
nunciation has middle or close e but who have either been taught 
that e is more “correct” or are influenced by orthography. 

645. Tradition is no doubt responsible for the erratic pronuncia¬ 
tion of r which may be observed very frequently on the French 
stage. Many actors when speaking in declamatory style employ 
lingual r and uvular r indiscriminately. Presumably in their 
“natural” pronunciation they use uvular r, but there is a tradition 
that lingual r is "correct” in declamatory style, and they aim at 
using it but often fail to do so. 11 

646. It seems probable that the use of a sound of the quality 
of the British English short o (d) in American English is "un¬ 
natural,” and is consciously acquired by those who use it; .hence 
its lack of stability. 12 The same no doubt applies to the variety 
of "middle a” (between the American a (ae) and a) which is said to 
be used by some Americans in such words as ask, path. 

647. It has been necessary here to make these references to 
mixed dialect and intentional changes on account of their fairly 
frequent occurrence. It is evident, however, that erratic pronuncia¬ 
tions other than those found in variphones have no place in the 
phoneme theory. The speech of those whose pronunciation is 
unstable cannot be reduced to phonemes at all, unless the instability 
is due to the existence of a variphone. 

648. It may be added that there may possibly exist variphones 
which have originated from intentional or unintentional changes 
of pronunciation. Possibly, for instance, t and ? may be considered 
to form a variphone in the speech of some of those who use them 
indiscriminately in such words as slightly, fortnight as mentioned 
in §639, although this erratic pronunciation almost certainly arises 
either from mixture of dialect or from spelling-pronunciation. 


11 This may be observed for instance in Sarah Bernhardt's gramophone 
record of a passage from Rostand’s La Samaritaine (His Master’s Voice record 
No. G.C.3H71, II), an analysis of which was given in my Intonation Curves, 
pp. 31-37. 

12 D “ cannot be considered a stable and well recognized phoneme in General 
American” (Kenyon, American Pronunciation, tenth edition, p. 185). 
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CHAPTER XXIX 

MENTALISTIC AND FUNCTIONAL CONCEPTIONS 
OF THE PHONEME 


649. The conception of the phoneme put forward in this book 
may be described as a ‘‘physical’' one, since the phoneme has been 
treated as “a family of sounds.” 1 It is only right, however, to 
mention that other conceptions of the phoneme exist and, in 
particular, a conception which may be termed "mentalistic,” and 
another which may be termed “functional" or “structural.” Not 
being a psychologist, I am not in a position to expound a mentalistic 
view in a manner that would satisfy experts in psychology. All 
I can do is to give a rough idea of what appears to me to be meant 
by those who take a mentalistic view of the phoneme, in the hope 
that readers who have had psychological training may investigate 
its merits. 

650. The idea of those who hold a mentalistic view would appear 
to be in essence that a phoneme is a single "abstract sound” 2 or 
"sound image,” which the speaker is capable of bringing to mind 
at will, and which he (subconsciously) endeavours to manifest 
concretely when he speaks. 3 In some phonetic contexts he 


1 “Abstract” sounds of the first degree. See my paper on Concrete and 
Abstract Sounds in the Proceedings of the Third International Congress of 
Phonetic Sciences (Ghent, 1938). 

2 Presumably of the second degree. 

3 A critical examination of this statement may show that, like many other 
statements concerning fundamental concepts, it implies a vicious circle. I 
am not sure that it is possible to give any "definition” of a phoneme, other 
than a “physical” one, which does not contain an implicit reference to the 
thing to be defined. This does not mean that we should not do our best 
to arrive at definitions of fundamental concepts. (See my article Some 
Thoughts on the Phoneme in the Transactions of the Philological Society, 1944.) 
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succeeds in giving accurate concrete expression to the "sound 
image," but in other phonetic contexts he fails to do so and, 
utters a sound resembling though not identical with the sound 
aimed at. 4 

651. To give an example, it may be supposed that a particular 
speaker of English has a conception of an "abstract” or “ideal” 
t-sound. He brings this sound accurately into concrete manu- 
festation when he says such a word as ten, but he utters various 
approximations to this in other phonetic contexts, e.g. in trap, 
eit9, 'lets, 'setl, 'bAtn. Similarly he may have a conception of 
an "ideal" short a, which appears accurately when he says kat, 
tap, and many other words; when he says such a word as 'alfabit 
[alphabet) he may be considered to aim at producing the same 
sound, but the vowel which actually emerges is somewhat different. 5 

652. Professor Baudouin de Courtenay, 6 the first to formulate 
the phoneme theory, 7 took a “psychological" view of the phoneme. 
He defined phonemes as “mental images,” 8 and accordingly 
distinguished two kinds of phonetics which he called “physio- 
phonetics” and “psychophonetics” respectively. He applied the 
term “physiophonetics” to the study of sounds actually uttered, 
and used the term ‘'psychophonetics” to denote the study of the 
"mental images” which uttered sounds are intended to represent. 
He likewise distinguished two kinds of phonetic writing which he 


4 According to this theory the sound-image would appear to be in perpetual 

existence in some non-material way. For it is presumably not possible to 
bring to mind at will a non-existent thing. 

6 The differences between the varieties of short a are well heard if one 
whispers words containing this phoneme. In whispered speech the resonance 
pitches of the vowels become audible, and it is found for instance that the 
front resonance pitch of the a in. 'alfabit is lower than that of the commoner 
short a of tap, etc. 

• 1845-1929. 

7 See Appendix I. 

* See Z. M. Arend, Baudouin de Courtenay and the Phoneme Idea, in Le 
Maitre Phondtique, January, 1934. 
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termed “physiophonic” transcription and “psychophonic” trans¬ 
cription, the first being a symbolization in writing of sounds actually 
uttered, and the second being a symbolization of the phonemes, 
i.e. his “mental images” or sounds which the speaker supposedly 
aims at uttering. These terms correspond in the main to the 
"narrow” and "broad” of the English phoneticians—terms originally 
invented by Henry Sweet 9 and used by him in his Handbook of 
Phonetics (Oxford, 1877). 10 

653. Among others who have held a mentalistic view of the 
phoneme was the eminent linguistician E. Sapir. His opinions 
were ably set out by him in an article Sound Patterns in Language 
which appeared in Language, Vol. I, 1925, pp. 37-51, and in an 
article La Realite Psychologique des Phonemes in the Journal de 
Psychologie, January-April, 1933. These articles should be closely 
studied by all who interest themselves in the theory of the 
phoneme. In Sound Patterns in Language he used the term “ideal 
sounds” apparently to mean phonemes (regarded mentalistically), 
and expressed the view that “ ‘ideal sounds,’ which are constructed 
from one’s intuitive feeling of the significant relations between 
the objective sounds, are more ‘real’ to a naive speaker than the 
objective sounds themselves.” In the same article he said that 
"a complex psychology of association and pattern is implicit in the 
utterance of the simplest consonant or vowel.” And again, "A 
‘place’ is intuitively found for a sound (which is here thought of 
as a true ‘point in the pattern,’ 11 not a mere conditional variant 12 ) 
in such a system because of a general feeling of its phonetic relation 
ships to all other sounds” (p. 47). And again at the end of the 
article: "The whole aim and spirit of this paper has been to show 
that phonetic phenomena are not physical phenomena per se, 
however necessary in the preliminary stages of inductive linguistic 
research it may be to get at the phonetic facts by way of their 
physical embodiment. The present discussion is really a special 


• 1846-1912. 

10 Sweet's investigations in the field of phonetics likewise began about 
1870. He did not treat the subject on psychological lines, and his researches 
were carried on independently of those of Baudouin de Courtenay. It 
is therefore interesting to find that these two great pioneers arrived at similar 
practical conclusions concerning phonetic transcription. 

11 Meaning presumably a principal member of a phoneme. 

l * Meaning presumably a subsidiary member of a phoneme. 
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illustration of the necessity of getting behind the sense data of any 
type of expression in order to grasp the intuitively felt and com¬ 
municated forms which alone give significance to such expression.” 

654. There are authorities who reject the mentalistic conception 
of the phoneme, but who at the same time maintain that the 
phoneme cannot be described in terms of the sounds by which 
it is manifested. They define the phoneme by reference to its 
function in the structure of language. 

655. Foremost among these was Trubetzkoy, 13 for whose 
opinions I have great respect. Trubetzkoy appears* to have 
regarded phonemes as the minimum segmental elements of language 
which are capable, by substitution, of distinguishing one word from 
another. He defined them as “phonological entities which from 
the standpoint of the language under consideration cannot be sub¬ 
divided into smaller consecutive entities,” 14 and expressed the view 
that they are “differentiating signs,” 15 and can only be defined 
by reference to their function in the structure of each language. 16 

656. Trubetzkoy also said that a phoneme is "the sum of 
the phonologically relevant features of a ‘Lautgebilde’.” 17 The 
meaning he attached to the term "Lautgebilde” is not clear to me. 
He evidently did not mean by it “sound-image” in the mentalistic 
sense, for he expressly stated that “the phoneme is a linguistic 
and not a psychological idea.” 18 Trubetzkoy was far too able 
a thinker to use his technical terms loosely, and I believe he was 
perfectly clear in his own mind as to what he meant when he used 
such terms as "phoneme” and "Lautgebilde.” But, owing to the 
deficiencies of language as a medium of expression, 19 1, and perhaps 


13 1890-1938. 

14 N. S. Trubetzkoy, Grundziige der Phonologie, p. 34 (now available in a 
French translation by J. Cantineau, bottom of p. 37). 

16 Ibid., p. 41 (Cantineau, p. 46). 

16 Ibid., p. 39 (Cantineau, p. 44). 

17 Ibid., p. 35 (Cantineau, p. 39). 

18 Ibid., p. 37. At times Trubetzkoy employed the terms "Lautabsicht” 
(sound-intention) and "Lautvorstellung’' (sound-image), but eventually 
rejected them as inadequate ( Grundziige der Phonologie, p. 37, Cantineau, 
p. 42). 

“"Speech, indeed, is nothing more than a series of rough hints which the 
hearer must interpret in order to arrive at the meaning which the speaker 
wishes to convey” (L. R. Palmer, Introduction to Modern Linguistics, p. 82). 
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others, have not as yet fully understood his meaning. It is satis¬ 
factory, however, to find that Trubetzkoy’s view leads to the 
same practical conclusions as other formulations of the theory, 
namely, that it provides the essential materials for the analysis of 
language structure and a sound basis for phonetic writing. 

657. Bloomfield appears to have envisaged the phoneme theory 
up to a point in much the same way as Trubetzkoy. He defined 
phonemes as “minimum units of distinctive sound-feature,” 20 and 
as "the smallest units which make a difference in meaning.” 21 He 
has said too that “the phonemes of a language are not sounds, but 
merely features of sound which the speakers have been trained to 
produce and recognize in the current of actual speech-sound.” 22 
W. F. Twaddell, on the other hand, has expressed the view that 
phonemes have no real existence either “physically” or "mentally,” 
but are merely "abstractional fictitious units.” 22 This appears to 
be also the view of L. Hjelmslev and the Danish school. According 
to them it would seem to be immaterial whether languages are 
pronounced accurately or not, or indeed, whether any pronunciation 
is given to the phonemes at all. It is not necessary here to enter 
upon any discussion of the validity of such opinions, since such 
discussion would inevitably resolve itself into a discussion of the 
reality or unreality of things in general and of "ideas” in particular. 
It is sufficient for the present purpose to remark that these views 
though neither physical nor mentalistic would seem to lead to the 
same practical conclusions as the other views referred to. 

658. It may be added that I find all the attempted definitions 
of the phoneme to be unsatisfactory on account of the apparent 
impossibility of knowing precisely what is meant by "a language.” 


*° L. Bloomfield, Language, p. 79. 

“ Ibid., p. 136. 

** On the other hand Bloomfield, very unfortunately in my opinion, 
introduced the term “secondary phoneme” to denote any sound-attribute 
(suprasegmental feature) which serves to make a minimal distinction between 
one word and another. His numerous followers in America have ceased to 
use his qualifying word "secondary,” and the result has been that the term 
“phoneme” has acquired in America a meaning quite different from that 
intended by the European pioneers. See Appendix I. 

as W. F. Twaddell, On defining the Phoneme (Language Monographs, 
published by the Linguistic Society of America, No. XVI, 1935). 
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The term “a language” is a vague one with psychological or super¬ 
physical connotations. It may presumably be said to mean the 
total material upon which a person can draw when he speaks. 
But where is that material? It surely “exists” only in some 
mentalistic or non-material sense. In fact it looks as if it were 
one of the numerous primary notions which cannot be defined, 
but upon which we nevertheless erect sciences. 

659. It should be pointed out too that phonemes can hardly 
be said to "exist” if sounds can be "assigned” to them in the manner 
suggested in various places in this book. If a phoneme has any 
sort of superphysical “existence,” it presumably follows that all 
concrete manifestations of it are predetermined, so that there can 
never be any doubt as to the phoneme to which any particular 
sound in a given setting belongs. A possible alternative view is 
that most phonemes have a superphysical “existence,” but that 
there are exceptional cases of phonemes of an indefinite or transitional 
nature. The difficulties of reconciling any synchronic theory of 
phonemes with many of the diachronic phenomena observable in 
the history of languages lend some support to this possibility. 
(See Chap. XXXII.) 

660. However that may be, I submit that it is incumbent on 
us to keep an open mind on all these matters, and to give special 
attention to the possibility that the phoneme may after all prove 
to have an "existence” of some kind, which may become evident 
in a remote future, if, as is thought by some, evolution brings with 
it a further development of men’s faculties, giving them a clearer 
perception than most of us at present possess of a fourth dimension 
of space or a second dimension of time. 24 

CHAPTER XXX 

THE PRACTICAL USE OF THE THEORY OF 
PHONEMES 

661. I take it as axiomatic that every man is endowed with 
certain abilities which he can use for the general good. People 
with ability for applied phonetics have in a particularly high degree 
the capacity for doing “useful" work—by which term I mean 


24 See Concrete and Abstract Sounds, especially p. 6. 
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work “conducive to the ultimate well-being of humanity.” In 
contrast with workers in some other scientific fields, they can 
keep in view throughout their investigations a definite humanistic 
object towards which they can direct their attention, namely, the 
improvement of the means of oral and written communication 
between man and man. The findings of phonetic science give 
people skill in communicating effectively with each other. 1 

662. It is because phonemes are the basic elements of com¬ 
munication by language that an investigation of their nature 
and their potentialities is “useful” in the above sense. 

663. Phonemes are the basic elements in that they cannot help 
constituting indivisible semantic units of which languages are com¬ 
posed. They are the elements which, when put into sequences (and 
in some languages supplemented by other significant features), 
make larger semantic entities, namely, morphemes, 2 words and 
sentences. The main uses of the theory of these elements are 
outlined in the following paragraphs. 

664. Firstly, it must be observed that the phoneme, as described 
in this book, is a conception of a phonetic nature, and the formula¬ 
tion of the phonemic structure of a language is one of the aids to 
acquiring a good pronunciation by the learner of that language. 
By means of phonetics he learns to utter foreign sounds. By 
memorizing, or with the aid of phonetic transcription (see §§669 ff.), 
he learns how “to use the right sound in the right place”—which 
he can do, if desired, before concerning himself with meanings. 
From formulations of phonemic structure he learns how “sounds 
are modified according to context” in the language he is learning, 
and how the principles of such “modification” differ from those 
followed in his native language. 

666 . Secondly, it is the phonemic idea that forms the basis for the 
non-phonetic branches of linguistic science, i.e. semantics, morpho¬ 
logy, grammar, etc. These branches are concerned with the order 
in which phonemes are placed in constructing the larger semantic 


1 Henry Sweet pointed out years ago that phonetics is an art as well as 
a science (Practical Study of Languages, p. 4). 

a The units of form, such as stems, prefixes, suffixes. See Trubetzkoy, 
Grundztige der Phonologie, p. 225 (Cantineau’s translation, p. 270). 
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entities, with the additions, subtractions and substitutions of 
phonemes made for different purposes, and with the incidence of the 
other semantic elements (if any). Phonemes in fact lie at the root of 
everything that is required for enabling language learners to use the 
right words, to put the words into their various forms, and to use 
the forms appropriately. In fact, all practical linguistic attainments 
may be said to depend ultimately on the theory of phonemes. 

666 . Thirdly, the phoneme theory has a particularly important 
use in connexion with the construction of systems of writing. The 
analysis of a language into its constituent phonemes furnishes us 
with the means of writing it in the simplest manner that is con¬ 
sistent with avoiding ambiguity. That the phoneme theory can 
render this service has already been demonstrated in preceding 
chapters, but it will be well to give in a separate chapter (Chap. 
XXXI) a summary account of the principles involved together 
with a few examples in illustration. 

CHAPTER XXXI 

FURTHER REMARKS ON PHONETIC WRITING 

667. The assignment of one letter 1 to each phoneme of a language, 
together with suitable marking of any other significant sound- 
attributes, is both necessary and sufficient for representing every 
“word” of the language unambiguously in writing. The use of as 
many letters as there are phonemes is necessary because the use 
of fewer letters would cause words of different pronunciation to be 
written alike. It is sufficient, since to employ a greater number 
of letters would mean either writing a sound in more than one 
way or exhibiting distinctions of pronunciation which the native 
speaker is as a rule unaware of, and which are of no importance 
from his point of view. In either case the superfluous letters 
would not serve to distinguish words from each other or help to 
specify what words were meant. 2 

1 "Letter” is to be taken in these paragraphs to include digraphs when 
they are appropriate. 

2 Except when it may be advisable to introduce unpronounced letters 
into an orthography with a view to making written distinctions between 
■words of identical pronunciation (homophones). (See §§704, 705.) 
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668 . Two kinds of simple writing can be established with the 
aid of the phoneme theory, namely, phonetic transcription and 
orthography. 


Phonetic Transcription 

669. Phonetic transcription is a style of writing in which the 
principle of representing each phoneme by a particular letter (or 
sometimes more than one letter if the transcript is “narrow”) is 
strictly adhered to not only for isolated words but also for con¬ 
nected sentences. Its chief objects are (1) to put on record the 
phonemic make-up of languages, and (2) to facilitate the acquisition 
of a good pronunciation of foreign languages, by showing the learner 
which of the sounds of the language are the appropriate ones to 
use in any given word or sentence. The simplest form of trans¬ 
cription adequate for this purpose is that which records the language 
by means of one letter for each phoneme, i.e. what is called a 
“phonemic” or “linguistically broad transcription.” When the 
student of the language has learnt to utter correctly the various 
members of each phoneme and knows the conventions governing 
the use of the members, the broad transcription is an infallible guide 
for indicating to him which sounds are to be used in any particular 
case. The transcription must be supplemented by marking any 
other significant attributes (e.g. lengths, stresses or voice-pitches) 
that may occur in the language. Such a form of transcription 
shows in writing the structure of the language to be learnt. 

670. It is sometimes convenient in the practical teaching of 
languages to use transcriptions which are to some extent “narrowed,” 
by the addition of letters or marks representing special members 
of phonemes. They are called “allophonic” or “linguistically 
narrow” transcriptions. The idea of using such forms of trans¬ 
cription is to help the student of a foreign language to learn the 
rules governing the use of particular members of phonemes, rules 
which he must learn sooner or later if he wishes to pronounce well. 
In my experience such aid is generally unnecessary, and it has the 
disadvantage of making the transcription needlessly complex. 
When the rules governing the use of particular members of phonemes 
are simple, as is usually the case, they should be formulated by the 
teacher and learnt by the pupil. Thus it is in my experience 
just as easy for the learner of English to put into practice the 
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instruction “You must give I the dark sound whenever it is final 
or followed by a consonant” as to learn that “in transcribing 
English the letter ! is never written finally and before consonants; 
its place is taken by +.” Likewise it is just as easy for the English 
student of French to learn when the letter y stands for the con¬ 
sonantal value in broad transcription as to learn when he should 
write y (or the special letter i|) in narrow transcription. 

671. When, however, the rules governing the use of special 
members of phonemes are involved and difficult to remember, the 
use of narrowed transcriptions can be justified as a help to learning 
these rules, though I do not think the use of such transcripts is 
indispensable. If, for instance, I were called upon to teach the 
pronunciation of a type of German in which x and 5 belonged to 
a single phoneme, 3 ! should feel disposed to narrow the transcription 
by using the two symbols [x] and [ 5 ]. I should do this on the 
ground that [ 5 ] is very unlike [x] in sound and that the principles 
governing the use of the two sounds are rather involved . 4 I believe, 
however, that good results could also be obtained by using the letter 
x only, provided that hyphens are inserted where necessary to show 
whether a sound begins or ends a syllable. 

672. For a similar reason there is something to be said in favour 
of representing the short weakly stressed German e-sounds^ by a 
in phonetic texts intended for learners of that language, and of 
using y or ij to denote the consonantal y in teaching French. 

673. In a more complicated case, such as the use of the numerous 
variants of a in Arabic, the representation of the main variants 
by special symbols is certainly justifiable. Gairdner indicated 
three and sometimes four members of the a-phoneme in his books , 5 6 
this being one way of helping students of spoken Arabic with the 
difficult task of learning when to use the variants. But the end 
is achieved at the cost of legibility of the texts. If this method is 
used, I consider it desirable to pass on to broadly transcribed, 
and therefore more easily legible, texts as soon as the learner has 

3 I.e. a type in which Wacholder and similar words are pronounced with h 

following the x as mentioned on p. 69, footnote 37. 

1 See §230. 

6 W. H. T. Gairdner, The Phonetics of Arabic and Egyptian Colloquial 
Arabic. Also Elder, Egyptian Arabic Phonetic Reader. 
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a fairly clear idea of the situations in which the variant a’s are 
used. 6 

674. Similarly a phonetic notation distinguishing perhaps three 
members of the e-phoneme of Pekingese might be found useful 
to some learners of that language in early stages, on account of the 
difficulty of remembering the circumstances under which each is 
used. (The sound is conditioned, not only by tone, as mentioned 
in §112, but also by the sounds preceding and following. It is, 
for instance, e-like when preceded by i, but A-like when preceded 
by various consonants, such as f, k. And the A-like quality varies 
according as the sound is final or followed by q or n (§503).) And 
if Catford’s description of the phonemic structure of the Kabardian 
language of the Caucasus is correct, 7 it would be most difficult 
to learn the principles governing the use of the vowel variants 
without some help in the shape of a narrow transcription of the 
vowels in connected texts. 

675. It is probable that a certain narrowing of transcription 
would be useful in teaching English learners to make proper use 
of vowel harmony in Somali. (See Armstrong, Phonetic Structure 
of Somali, p. 149.) 

676. When narrow transcriptions are used for helping learners 
to remember the conditions under which various members of 
phonemes are employed, it is desirable from time to time to test 
their understanding of the principles by prescribing as an exercise 
the rewriting in broad transcription of a passage written narrowly. 
Later the learners may with benefit reverse the process and practise 
converting broad transcriptions into narrower ones. 

677. It should be noted that narrowings of transcriptions for 
learners of any particular language may differ according to the 
nationality of the learner. 

678. It is sometimes helpful to introduce narrowings into 
phonetic transcriptions intended for use in comparative work, and 
particularly in the comparison of different forms of a language. 
For instance, when teaching the history of English pronunciation, 

8 It is now thought by some that there are two a phonemes in Cairene 
Arabic. This possibility furnishes an additional reason for narrowing the 
transcription. 

7 J. C. Catford, Specimen of Kabardian with Notes on the Pronunciation, 
in Le Maitre Phonetique, July-December, 1942. 
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I find it convenient to narrow the transcription of present-day 
English by making use of the letters e and o. The reason is that 
older forms of English (Chaucerian, for example) are believed to 
have contained long e: and d: as well as long e: and or. Con¬ 
sequently in transcribing such older forms the short vowels 
corresponding to e: and d: must necessarily be written with the 
letters e and d. And as these short sounds were no doubt much 
the same as those used to-day in the same words, it is convenient 
to use these letters in modern English texts transcribed for purposes 
of comparison. (But in a description of modern Southern English— 
and particularly in transcriptions designed for teaching Southern 
English to foreign learners—the more familiar letters e and o 
can well be used in place of e and a.) 

679. For a similar reason I feel that Gairdner did well to 
use two signs for the glottal stop (? and p) in his transcriptions 
of the colloquial Arabic of Cairo, since it seems hardly possible 
for any learner of colloquial Arabic to ignore completely the classical 
(in which the p is pronounced as q). 

680. The following is an example of a different kind. In writing 
the Sotho language broadly the letter I serves to represent the d 
used in that language whenever i or u follows, since the sound I 
never occurs in that position: the two sounds d and I belong to a 
single phoneme. In the allied language Pedi there is no d, but 
there is the peculiar d-like I (referred to in §§81,127) and an ordinary 
I, both of which occur before i and u; the sounds occur therefore as 
separate phonemes, and a special letter (j or d) must be assigned to 
the d-like I. It may therefore well happen that a writer comparing 
the Sotho and Pedi languages would find it convenient to denote 
the d-sound by d instead of I in Sotho words. This is in fact what 
A. N. Tucker has done in his Comparative Study of the Suto-Chuana 
Group of Bantu Languages ; see particularly p. 49 of that book 
(first edition). 

681. Forms of transcription containing special letters to show 
that particular sounds of a language differ from analogous sounds 
of some other language are called “comparative” or “typographically 
narrow” transcriptions. It is customary, and perhaps necessary, 
to use both linguistically and typographically narrow transcriptions 
in works dealing with the comparison of existing dialects, and 
particularly in dialect atlases. In transcribing connected dialect 
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texts, however, it is possible and in my view advisable in the interests 
of legibility and clear exposition to broaden the transcription and 
to make it unambiguous by prefixing to the transcribed texts of each 
dialect a summary of the conventions to be observed in reading them. 

682. Some teachers have advocated the use of “typographically 
narrow” transcriptions in texts designed for language learners. It 
has been proposed, for instance, that the English vowel in red 
should be represented by a symbol other than e or e on the ground 
that it is intermediate in quality between the two French sounds 
generally transcribed with e and e, that the English vowel in hot 
should be represented by some letter other than o or a since these 
are used to denote other shades of vowel sound in French, that a 
special symbol should be used to denote the English r-sound since 
this English consonant differs from the Italian r. Long teaching 
experience has led me to the conclusion that it is as a rule inadvisable 
to introduce letters for such reasons in transcripts for language 
learners. The additional letters complicate the texts, and do not 
in my experience facilitate the task of the pupil except in very 
unusual cases. (See further my article Types of Phonetic Trans¬ 
cription, in Appendix A to the 8th edition, 1956, of my Outline of 
English Phonetics.) 

683. There is no end to the number of symbols that might be 
introduced on these grounds. For instance, if special letters were 
adopted for the above-mentioned English sounds, an equally good 
case could be made out for representing the English consonant t 
by a specially designed letter to show that it differs from the t of 
Spanish or Hindustani, for using different letters for the 9-sounds 
of English and Danish because the sounds are not quite the same, 
or for representing the Japanese u by ui on the ground that it has 
less lip-rounding than the European u-sounds, or for denoting it 
by some specially designed letter in order to remind readers that 
it is not a completely unrounded u. 

684. The right solution of the problem of international phonetic 
transcription for language learners is, I believe, that the number 
of letters should be as small as practicable, and that each letter 
should have an elastic value. Among the values which each letter 
can represent is naturally a “cardinal” one. But it should, in my 
opinion, be a recognized principle that when a letter is not needed 
to denote a “cardinal” sound, it may be used for representing other 
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sounds having reasonably near relationships to that cardinal sound. 
And cases may occur where it is convenient to assign to a given letter 
other values than the cardinal whether or not the cardinal sound 
occurs as a member of a phoneme of the language to be transcribed. 

685. Stated in other terms the principle amounts to this: a 
particular letter cannot have precisely the same value in all the 
languages in which it is used; the learner of any particular language 
must familiarize himself with the value (or values when the phoneme 
has more than one member) assigned to it in that language. 

686 . But whether use is made of "comparative” symbols or not, 
the fundamental basis of transcription for language learners is and 
must be a representation of phonemes. 

687. Here may be mentioned the fact, well known to linguisti- 
cians, that when one is making a phonetic investigation of a language 
not previously analysed, it is necessary to start by noting down 
all the sounds one hears, using therefore an “impressionistic” trans¬ 
cription which may ultimately prove to be very narrow. In the 
initial stages of such work the observer should give no thought to 
possible groupings into phonemes, and above all he should endeavour 
as far as possible to rid himself of his natural tendency to think of 
any of the sounds as groupable into phonemes in the same manner 
as similar sounds of his own language. It is often only after writing 
out numerous texts in very narrow transcription that the proper 
phonemic groupings emerge, and that the investigator can in 
consequence devise the simplest way of writing the language 
phonetically. Good examples of the method of procedure are given in 
Sapir’s article La realite psychologique des phonemes mentioned in §653. 

688 . I have emphasized in §34 and elsewhere 8 that phonemic 
analysis can, in my opinion, only be based on "words” and not 
on connected speech. In order to ascertain the phonetic structure 
of a language the investigator must therefore know what sound- 
sequences are to be considered to be “words.” He must be familiar 
not only with ordinary word-division, but also with the divisions 
of compound words and possibly even to word-like prefixes and 
suffixes. If he is concerning himself with a language which has 
not yet been reduced to words, it is necessary for him to settle 
for himself what the "words” are. 


8 E.g. Some Thoughts on the Phoneme, pp. 127-132. 
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689. Broad (phonemic) transcripts should, in my view, show 
word divisions and (by hyphens) the elements of compound words 
and any word-like prefixes and suffixes, except when the divisions 
are self-evident. For without these indications transcripts might 
be ambiguous. This applies even to a language like French where 
a syllable often begins with the final consonant of a word which is 
closely linked to the initial vowel of a following word. See my 
article The Hyphen as a Phonetic Sign (Zeitschrift fur Phonetik, IX, 
2, 1956). 

690. One further general observation on transcription is needed 
here. Though broad transcription of a language is unambiguous 
in the sense that it records every word of the language in such a 
way that all words of different pronunciation are kept distinct 
in writing—so that all words are readily recognizable and the reader 
can attach the dictionary meanings to them—it gives no indication 
of the numberless other shades of meanings which in oral inter¬ 
course are continually superimposed on these meanings by sentence 
intonation, special voice qualities, special lengths and stresses, 
gestures and so on. Rough hints of some of such superimposed 
meanings may be suggested in writing by punctuation marks, 
italic print, underlining, etc.; but such devices are at best of very 
limited application, and they very often do not show at all adequately 
to the foreign learner how he should utter the words so as to suggest 
the desired meanings. A good deal of the necessary information 
may be given to him partly by rules and partly by special markings 
of length, stress and intonation incorporated in or placed next 
to the transcription. They cannot be regarded as a "narrowing” 
of it, since they apply to the sentence only and are not concerned 
with the differentiation of meanings of individual words. 

Orthography 

691. The second kind of writing, orthography, is what is needed 
for ordinary current intercourse in writing. This too, if it is 
alphabetic, should have the principle "one letter per phoneme” 
as its basis, but it generally has to differ in several ways from a 
transcription for language learners. The chief points of difference 
are set out in the following paragraphs. 

692. A transcription for those wishing to learn to speak a 
foreign language, must record the pronunciation to be actually 
used in each sentence transcribed. It must take account of the 
fact that in sentences words often have pronunciations different 
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from those they have in isolation. Words may, for instance, have 
“weak forms” as well as “strong forms” 9 ; they may have special 
pronunciations in the sentence due to assimilations 10 or elisions 11 
or coalescences 12 ; or again, there may be special phenomena such 
as liaison in French 13 or doubling of initial consonants in Italian. 14 

693. In an orthography, on the other hand, it is not as a rule 
advisable to show variant pronunciations of such types. What 
is needful in an orthography is that every “word” should have a 
definite and easily recognizable visual form. Subject to rare 
exceptions, each word should be written in one way only, and 
its orthographic form should in most cases be based on the 
pronunciation it has when said in isolation. 15 To alter the spelling 

* Such as the two pronunciations of at in 'wot 3 ju: 1 1UkiQ at? (What are 
you looking at?) and 'steiig 3t 'houm (staying at home). (See the sections 
on weak forms ("gradation") in books on the phonetics of English, e.g. my 
Outline of English Phonetics, Chapter XVI.) 

10 Such as the substitution of faif for faiv (five) in the compound 'faifpsns 
(fivepence), or in French the common substitution of beg for btk (bee) in 
beg d3 ga:z (bee de gaz), or in Italian the substitution of m for n in such an 
expression as um poko (un poco, a little). 

11 Such as the elision of the t of nekst (next) in neks taim (next time), 
or in French the elision of the 3 of da (de) in such an expression as pa d daje 
(pas de danger), or in Kikuyu the elision of the final a of araata in araata 
cega (good friends) which is pronounced ara:t Elga [•“_ ]. 

12 Such as the use of E in place of final a + initial e in Kikuyu, e.g. 
njerEiro [ ■ - '] (this road), which stands for njera enD [•••']• (See 
Armstrong, Phonetic and Tonal Structure of Kikuyu, Chapter III.) 

18 As shown, for instance, in le vil (les villes) but lez Dm (les homnies). 
(See Passy, Les Sons du Fran(ais, §§244-249; Armstrong, Phonetics of French, 
Chapter XIX; Stephan and Jones, Colloquial French for the English, Chapter 
LII; and other books on the phonetics of French ) 

14 As shown, for instance, in da kki (da chi, from whom) as compared 
with d i ki (di chi, of whom). (See A. Camilli, Pronuncia e grafia dell' italiano, 
§§89-95. Also A. Camilli, I rafforzamenti inizziali in italiano, in Le Maitre 
Phondtique, July, 1909 and May, 1911.) 

16 Words susceptible to liaison in French would be exceptions. The liaison 
forms are. in most words, the only convenient pronunciations to take as a 
basis for orthography. Such words as les, beaux, haut, hautes, alter, allez, 
allais, allait, allaient, would appear in an international orthography as lez, 
boz, hot, hotez, aler, alez, alez, alet, alet, with the convention that the final 
consonant is silent in speech except when liaison is used. To write non¬ 
liaison forms in an orthography would lead to much confusion on account 
of the variability in the use of liaison in speech. It would also add con¬ 
siderably to the number of homographs. 
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in order to conform to changes of pronunciation in connected 
speech would merely render the written words less easy to recognize. 
Spellings representing special pronunciations heard only in connected 
speech might often necessitate a mental translation into the sounds 
of the spoken language before the reader could recognize what 
words they were intended to represent; often too they would look 
like other words. Reading would in fact be hampered by the 
use of such variant spellings. 

694. The point may be illustrated by an extreme case. A 
man may say tem mins sem in the course of conversation, and 
his hearers will understand his meaning just as well as if he pro¬ 
nounced the expression in a precise manner. But if such a form 
as tem mins sem were used in current writing, it would merely 
constitute a puzzle which the reader might or might not decipher 
easily. 16 

695. An orthography differs also from a phonetic transcription 
for language learners in that it generally has to serve for a wide 
range of speakers with varying accents. The results are twofold. 
Firstly, some of the letters or combinations of letters have different 
significations with different speakers. Thus, if ei were chosen for 
writing the vowel element of such words as day, plate, its phonetic 
meaning would be various types of ei with some speakers, a pure 
long e: with many in Scotland and the North, varieties of ei and 
ai in London, and so forth. Secondly, where local or other variants 
are a matter not of sound formation but of sound distribution, 
some pronunciations have to be ignored, and orthographic forms 
adopted which are phonetic for some speakers though not for 
others. 

696. The following are examples of the latter case. In a 
reformed spelling of English, mark and farm would have to be 
written as they are now, since this spelling accords with the pro¬ 
nunciation of a majority of those whose mother tongue is English. 
It is not phonetic for me and for many others in the South of 
England, who pronounce ma:k, fa:m with precisely the same sound 
as la:f [laugh), ka:m [calm), etc. We could not, however, expect 

18 The appropriate writing in an international phonetic orthography would 
be ten minits lu seven. The spellings of the last two words would be invariable 
and would take no account of the fact that writers might pronounce tU or 
ta or t, 'sevan or 'sevn or 'sebm or sem. 
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the majority of the English-speaking world to accept spellings like 
maak, faam, which would be unphonetic to them. On the other 
hand, the distinction between the vowels of boot and book, food 
and good, pool and pull which is made throughout large areas of 
the English-speaking world would doubtless have to be shown in 
a reformed spelling, in spite of the fact that no such distinction 
is made in Scotland. 

697. So also terminations like -est, -less, -ness would have to 
be distinguished from -ist, -is, etc., in a reformed spelling of English. 
The former are pronounced -ist, -lis, -nis by many, including myself, 
but -ast, -las, -nas by others. I think, therefore, that the Simplified 
Spelling Society has been well advised to distinguish between the 
short i’s that have common variants with a and those which do 
not normally have such variants: they write puerist (for present 
spelling purist ), 11 tenis, ofis, etc., but puerest (for present spelling 
purest ), 11 goodnes , 11 hoeples , 11 etc. 

698. It might likewise prove advisable to make allowance 
for the widespread distinction of vowel between words like port, 
forth, course, hoarse, etc., and short, north, horse, etc., despite the 
fact that many do not observe the distinction in their speech. 
Such words could be written in an international orthography 
port, forO, kors, hors, and fort, nord, hors, etc. 

699. Striking compromise spellings would have to be adopted 
in a phonetic orthography for French. All possible "e mutes" 
would have to be written, although numbers of them are not 
sounded in Northern French. The distinction between a and a, 
so commonly made in the North of France, would probably have 
to be ignored; possibly also the distinction between a and oe. 
Distinctions in the length of e would probably have to be shown, 
though many French speakers do not observe them. And last, 
but by no means least, numerous unpronounced letters would have 
to be introduced, as indeed they are in present-day spelling, on 
account of the phenomenon known as “liaison” (see footnote 15 
to §693). 

700. In a phonetic orthography for German, p and b would 
both have to be used, despite the fact that many Germans do not 


17 International phonetic orthography of these words would be pyurist, 
pyurest, gudnes, houples. 
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differentiate the sounds. A phonetic orthography for Hindi 
would have to include the letters q, f, z, x and y, although a 
great many speakers of that language do not use the sounds 
which these letters denote; they substitute k, ph, j, kh and g for 
them. 18 And so on. 

701. When two pronunciations of a word enjoy wide currency 
and appear to have equal claims to representation in orthography, 
it may be found advisable to admit two spellings. Thus it seems 
reasonable in a phonetic orthography for English to admit two 
spellings of words like grass and path, viz. graas, paaO and gras, 
pad, since both pronunciations are in wide use. The admission 
of these spellings would obviate the need for English-speaking 
children to memorize lists of words spelt in a way that is unphonetic 
for them. The same applies to various isolated words, e.g. aider 
and iider, akomplif and aksmplif , 19 garraaj and garrej, agein 
and agen. Similarly in a phonetic orthography for French the 
present aout might appear as au or aut or w or ut, and the 
present but, obus might be spelt byt, obys, or by, oby 20 mauvais 
might be written either movez or movez , 21 mceurs might have 
two forms mcers and meet, and alcoolique the two forms alkooliko 
and alkolikd ; done might be written dorjk or dot ) 22 according to 
the pronunciation of the writer, and so on. In a phonetic 
spelling of Italian cosa might be written kosa and koza, since 
both pronunciations are common. 

702. Other departures from phonetic consistency are often 
desirable in an orthography to avoid cumbrous spellings or on 
other grounds of practical convenience. Thus it would probably 
be found advisable in a reformed spelling of English on international 
lines to write ju (or yu) to represent the ju: of such words as new, 
hue, in place of the cumbrous though more accurate juu (or yuu). 
This simplification would be advisable in spite of the fact that the 


11 See The Problem of a National Script for India, pp. 8, 9. 

*• The sound A is adequately represented by a in an orthography. 

,0 Or byt, obys, etc., if the system were one employing y and j with English 
values. 

11 See footnote 15 to §693. 

** With preference for dorjk which would differentiate it from the word 
dor) (gift). (Note that in a phonetic orthography for French it is convenient 
to use 7) as the sign of nasalization.) 
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short ju of 'monjumant, etc., is phonemically distinct from it. 
(The case is one where there is a phonemic distinction which does 
not happen to be used for differentiating any words; there is not 
to my knowledge any pair of words distinguished by ju: and ju in 
English.) In an international phonetic orthography for German it 
would no doubt be found advisable to denote most, if not all, 
strongly stressed long vowels b,y doubled letters, e.g. to write not 
only Boot and Moos as now, but also vool (for wohl), hoox (for 
hoch), groos (for gross), etc. But there are cases where it would 
be unnecessarily cumbersome to write oo. Notable are pretonic 
syllables, where a simplification to single o could be made 
provided that the few words containing the ordinary short o 
(a) in pretonic open syllables were listed as exceptions. Thus 
one could without inconvenience write Lokomotiive, kloroform, 
Honoraar, in place of the cumbrous though more accurate Lookoo- 
mootiive, etc., possibly also vomit instead of voomit (for the present 
womit), etc.; but it would be necessary to specify the few exceptional 
words in which the o would have the ordinary short sound, 
e.g. folenden (for vollenden), Koleek (for Kolleg), Korektuur (for 
Korrektur ). 23 

703. It is sometimes possible to depart from the strictly phonetic 
principle in orthography in order to distinguish homophones, i.e. 
words of identical pronunciation but different meanings. This is 
justifiable when there are suitable grounds for the spellings chosen, 
and when the spellings of both words are unambiguous. It might, 
for instance, be considered advisable in a phonetic orthography 
for German to write final b, d and g in place of p, t and k 
in those words where derivative forms have the sounds b, d 
and g. Such spellings would not be ambiguous, since the 
sounds b, d and g never occur finally. Thus there would be a 
certain advantage in continuing to distinguish the words at present 
written Rat and Rad (which are both pronounced ra:t) by using 
the forms Raat and Raad ; these spellings would show the relation¬ 
ships with the plurals, which would presumably be written in 


23 It would also be possible to effect an economy of letters in connected texts 
by representing strongly stressed long vowels by single letters before v and z 
since short vowels do not occur before these consonants in German. Thus 
there would be no ambiguity if leezen and Leave were simplified to lezen 
and Leve. A further economy might be effected by writing final long vowels, 
other than e, with single letters, e.g. ja, vi, vo, Zoofa (but Zee, Alee). 
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phonetic orthography Reete and Reeder. Another example is 
found in the Hindi words pronounced ke. J. R. Firth has suggested 
that it would be advantageous in a phonetic spelling for Hindi 
to distinguish the post-position from the conjunction by writing 
the latter as ky. u So also, if a phonetic orthography were ever 
adopted for French, it might be found desirable to distinguish 
the words for “steal” and “fly” which are homophones with most 
French speakers, 22 by writing them voler and voler in accordance 
with a distinction of pronunciation heard in French-speaking 
Switzerland. 

704. It may even at times be found advisable to introduce 
unpronounced letters into orthography in order to show relation¬ 
ships between words or to distinguish homophones. Thus, though 
strict phonetic orthography of such French words as the plurals 
haute, prSts would be hoz, prez, it would no doubt be preferable in a 
reformed spelling on international basis to insert t and write hotz, 
pretz, to show the relationship of these words to the singulars hot, 
pret and the feminines hots, hotez, preete, preetez. Incidentally 
pretz would thus be distinguished visually from the homophone 
prez (present spelling pres). 

705. So also it might be found advisable to introduce a few 
historical spellings into a reformed orthography for English in 
order to distinguish homophones. One might, for instance, distin¬ 
guish the two sen' s as sen and senn (present spelling son, sun) and 
the two eit’s as eit and eiht (present spelling ate, eight ). 26 

706. The examples given in the foregoing paragraphs (§§691-705) 
suffice to show how different an orthography must needs be from a 
transcription designed for teaching pronunciation. Nevertheless it 

,4 As Firth has pointed out, y is the appropriate letter for representing 
the Hindustani so-called "short i" in a romanic phonetic orthography. 
Since the normal short i-sound never occurs finally in that language, no 
ambiguity can arise from the occasional use of y to denote e in final position. 
A y in this position could not mean anything else but e to a Hindi speaker. 

25 Both pronounced vale in Paris, and both pronounced vole in various 
other parts of France. 

*• Another course would be to write ate as et in accordance with the common 
pronunciation, and ignore the variant eit. The idea of writing h in a reformed 
spelling of eight was suggested to me by Mr. William Barkley. (See his 
book. The Two Englishes, p. 37.) 
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is clear that if an orthography is to do its work with the maximum 
of efficiency, and be easy to learn, it should have the principle 
“one letter per phoneme" as its basis. It should in fact come as 
near to a phonetic transcription as the special requirements of 
orthography allow. 27 


CHAPTER XXXII 

THE PHONEME IN THE HISTORY OF LANGUAGES 

707. The study of the history of a language involves among 
many other things a study of what happens to its phonemes in the 
course of time. It is found that changes in phonemic structure 
frequently take place as languages develop, and that the changes 
are of many different types. 1 In the following paragraphs examples 
are given of twenty-nine types of phonemic development. There 
are doubtless others. 

708. (1) Phonemes remaining unchanged .—Often phonemes 
appear to have remained stationary for long periods, that is to say, 
as far as we can tell, both principal and subsidiary members (if 
any) have retained the same values during those periods. Such 
are in all probability the p, m and w phonemes of English from 

27 The reasons why a phonetic orthography has to differ from a phonetic 
transcription are further elaborated in my paper Dhe Fonetik Aspekt ov 
Speling Reform (published for the Simplified Spelling Society by Sir Isaac 
Pitman and Sons), and in my paper on Differences between Spoken and Written 
Language published by the Association Phonetique Internationale. 

1 When we speak of the sounds and phonemes of a language at some past 
date, we mean the sounds and phonemes which we think are likely to have 
been used by typical speakers of a particular region at that period. We 
cannot establish the pronunciation of bygone days with the precision attainable 
by direct observation of the speech of to-day, though we have reason to think 
that it is often possible to reconstruct old pronunciations with a very fair 
degree of accuracy. (See, for instance, the evidence quoted in my article 
on the pronunciation of Shakespeare’s day on pp. 38-41 of English Pronuncia¬ 
tion through the Centuries, published by the Linguaphone Institute.) It must 
be remembered too that there doubtless existed at all previous periods in 
the history of a language just as many different ways of pronouncing as 
there are now, if not more. 
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Old English times to the present day and the French i-phoneme 
during a similar period. 

709. (2) Change affecting an entire phoneme. —-Sometimes all 
the members of a phoneme have changed in value and still remained 
together as a phoneme. An example is the change of the Vulgar 
Latin u (derived from Classical Latin long u: 2 ) into y in French. 
The principal member of this phoneme was doubtless a close u 
identical with that of Modern Italian. Its representative in Modern 
French is a close y; example, Latin 'mu:ro (< Classical 'muirum) 
> French my:r. We cannot tell if there were any subsidiary 
members of the Vulgar Latin u differing appreciably from the 
principal u, but if there were, they have all been replaced by 
varieties of y in Modem French; thus the somewhat open y commonly 
used in French in non-final positions, as, for instance, in fy me (fumer), 
may well have come from a somewhat open u in the Vulgar Latin 
fu'maire (< Classical fu:'ma:re). 

710. This type of change must be distinguished from the case 
where one phoneme is substituted for another in the course of the 
development of a language. The latter is exemplified by the 
substitution of oe for a former a in the Parisian French pronunciation 
ol joli and absolu (33!i > 3oeli, apsoly > apsoely, footnote 3 to §306) 
and by the substitution of i: for e: in Early Modern English in the 
words which had had e: in Middle English, as in meat (me:t > mi:t), 
sea (se: > si:). (See §716, where it is shown that in Early Modern 
English e: emerged as a phoneme without any short vowel attached 
to it as a subsidiary member.) 

711. (3) Change in principal member, a subsidiary member 
being unchanged but remaining attached to the new principal member .— 
An example of this is seen in the development of the Latin b in 
Spanish. It is believed that, at one time, the Latin phoneme 
represented by the letter b had ordinary voiced b-sounds as its 
principal and subsidiary members similar to those found in the 
Modem French b-phoneme. This developed in Spanish into a 
phoneme having the fricative sound (3 as its principal member 
and plosive b as a subsidiary member, the latter being used when 


a It cannot be doubted that in late Vulgar Latin the Classical Latin short u 
acquired O-quality as in Modern Italian, and joined the phoneme containing 
the long close o. 
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m precedes. Example: Latin 'bibere > Spanish [(3e'Per], but 
Latin im'bibere > Spanish [embe'Per], 

712. (4) Change in a subsidiary member of a phoneme with 
continued attachment to an unaltered principal member. —This case 
is one which may be expected to arise fairly frequently, but it is 
difficult to find examples of which we can be certain on account of 
lack of evidence as to the values of subsidiary members in times 
past. A possible illustration is the development which has led to 
the use of ? in place of a previous t following a strongly stressed 
vowel in many types of modern English. Instances are seen in rai?, 
'li?l, 'mA?n which are common modern pronunciations of right, little, 
mutton. It is commonly supposed that the use of ? in such words 
is a modern innovation, and that formerly the words were always 
said with t . 3 

713. Possibly the use by some English speakers of ai as a variant 
of ai in unstressed syllables is another case in point. 4 It is so if 
a sound of the ai-type was at one time used in all positions and 
subsequently the weakly stressed ai’s were altered to ai. It appears, 
however, impossible to prove that this happened. 6 

714. (5) Change in a principal member together with development 
of a new subsidiary member. —The apparently recent development 


* The example is not one of which we can be very certain for two reasons. 
Firstly, it has not yet been definitely established that t and ? belong to the 
same phoneme in the speech of any of those who pronounce in this way. 
The problem is complicated by the fact that many of those who use ? in 
these words are erratic in their speech; also by the fact that ? is also used 
by some where the older pronunciation is supposed to have had k or p (e.g. 
'bei?a for baker, 'pi:?l for people). (See §620.) 

Secondly, it is not impossible that this use of ? is of some antiquity. It 
is difficult otherwise to explain its wide distribution in different parts of Great 
Britain and in America. The fact that words of the above types have been 
regularly written with t from the earliest times does not tell us much. For 
if the sound ? had been in use in (say) Old English, it could hardly have 
been denoted otherwise than by t (or perhaps k) when people first attempted 
to write English with the Latin alphabet. 

4 This is the pronunciation represented in Sweet’s Primer of Spoken 
English, where he wrote, for instance, ai in ties and besides, but ai in 
sympathize and idea. 

5 Another possible development is that all the Middle English i:’s changed 
to a sound of the ai-type as explained in §718, and that then the stressed ones 
developed still further into varieties of ai. 
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of the phoneme of London dialect corresponding to “average” 
English long u: may be an example of this. The principal member 
of this dialectal phoneme is su; it is heard in such words as [t s su] 
{two), [maun] {moon). There is a subsidiary member of the o- 
type used before "dark” I, as in [sk<?:l] {school), [roil] {rule), etc. 

715. That this is an example of the case under consideration 
rests on the assumption that these London dialect sounds are 
derived from a single sound identical with, or at least not unlike 
the “average” English u:. It is difficult to prove this definitely 
to be a fact, but it is perhaps not too much to assume that “average” 
English is on the whole more conservative than particular local 
dialects, and that for that reason the assumption has some 
justification. 

716. (6) Change in a principal member resulting in its becoming 
attached to another phoneme .—Instances of this have been frequent 
in English on account of the fact that several short vowels have, 
in all probability, undergone little or no alteration for hundreds 
of years, while the long vowels have all changed greatly since the 
Middle English period. The history of the vowels in such words 
as (1) rest, yet, (2) meat, sea, (3) make, face provides two illustrations. 
It is believed that the vowel of such words as rest, yet, has been 
(on an average) 6 a variety of £ for the last thousand years. As to 
the other vowels there is evidence to show that in Middle English 
the first was a long e: and the second a very front long a:, and that 
both these gradually changed to "closer” sounds, reaching, by the 
sixteenth century, values in the neighbourhood of cardinal e: and 
e: respectively. At a certain point during this development the 
sound representing Middle English long e: must have become 
detached from the phoneme containing short e, while the new 
long e: emerging from the earlier a: must have become attached to 
it instead. This transference may be presumed to have taken 
place at the time when the quality of the new e: had nearly reached 
that of the short £, and when the previous e: would have progressed 


* Several varieties of vowel are heard in such words at the present day, 
the .varieties being both individual and local. Doubtless such varieties have 
always existed. In fact their existence in early times is attested by variant 
spellings. We can only concern ourselves with what may be considered to 
have been an average pronunciation, namely, a sound not far from cardinal E. 
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some distance in the direction of close e:. This would presumably 
have been in the late fifteenth or early sixteenth century. 



Fig. 12. Diagram showing the Development of Middle English e: and a:. 


717. (7) Change in a principal member of a phoneme resulting 
in the appearance of a new phoneme unattached to any member of 
previously existing phonemes. —The development of Middle English 
e: referred to in the last paragraph furnishes an example of this. 
When this sound was replaced by closer ones (i.e. varieties of e:) 
and became detached from the phoneme containing short e, it 
evidently did not attach itself to any other short vowel of the 
language. (The only other short vowel near to it in quality was i, 
and in the sixteenth century this sound would naturally have 
been in the same phoneme with the long i: of such words as see, 
meet, which by that time had replaced the Middle English e:.) 

718. Another example found at the same period was no doubt 
the diphthong si, 7 which developed from Middle English i:, and 
must have become detached from the phoneme containing short i, 
to which its Middle English predecessor (i:) had belonged, without 
becoming associated into the same phoneme as any other sound 
existing at the time. Thus, in Middle English, the vowels in 
'mills (mile) and milk would have been grouped together in the 
same phoneme. But when i: had developed sufficiently in the 

7 A diphthongal sound intermediate between i: and ai which was probably 
in use in the sixteenth and seventeenth centuries. See my article on Shakes¬ 
peare in the Original Pronunciation, in English Pronunciation through the 
Centuries, pp. 36-45. 
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direction of si, it must have become separated from the short i 
and have constituted a separate phoneme, as it has continued to 
be in subsequent developments. 

719. Yet another example is found in the modern diphthongiza- 
tion of the e: which was probably in use in the sixteenth and 
seventeenth centimes in such words as make, table. If at one time 
the diphthongization was but slight, the diphthong (ei) might 
still have counted as belonging to the same phoneme as short e. 
In fact, in some forms of present-day speech, where the diphthongiza¬ 
tion of ei is not wide, a case can be made out for grouping it into 
the same phoneme as short e. But in the types of pronunciation 
where the development has gone further, and the diphthong is a 
wide one, this cannot well be done, since the sounds can hardly 
be said to be sufficiently nearly related. 

720. (8) Change in a subsidiary member resulting in its becoming 
attached to another phoneme .—Developments of this type are fairly 
common. The i and u phonemes of Classical Latin provide an 
example. The Classical Latin i-phoneme evidently comprised two 
members, a closer vowel when long and an opener one when short. 
The opener one must have been at one time sufficiently near in 
quality to the close one to justify using the same letter i to represent 
both. We may consider the quality used when the vowel was long 
as being the principal member of this phoneme. The subsequent 
history of the words containing these sounds shows us that in 
Vulgar Latin (a language which no doubt had considerable resem¬ 
blance to Modern Italian) the previous short i opened further and 
became similar in quality to the previous long e:, while length 
ceased to have significant function, and the Italian system of 
lengthening strongly stressed vowels in open syllables came into 
being. Thus arose developments like Classical Latin 'nivem > 
Vulgar Latin ‘ne:ve, Classical Latin 1 ubi > Vulgar Latin 'o:ve. The 
Classical Latin short i and u thus developed in such a way that 
they left the i and u phonemes and entered the e and o phonemes. 8 

721. Incidentally, it is possible that in Classical Latin the 
short e-sound, which doubtless was pronounced as a variety of e, 

8 It is noteworthy in this connexion that Classical Latin long u: became y 
in French (nuida > ny, etc.), while Classical Latin short u generally remained 
u in French (ursum > urs, gustum > gu, etc.). 
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would have belonged to the same phoneme as the long e:. 9 But 
when the usage in regard to length changed, as mentioned in the 
preceding paragraph, the short e became long in many words (e.g. 
Vulgar Latin and Italian 'pjmra, from Classical Latin 'pstra) and 
must therefore have separated itself from the phoneme used in the 
words which contained e: in Classical Latin. 

722. Another example is found in the Latin a-phoneme which 
is represented in French by a and £ and a according to circumstances: 
Vulgar Latin 'parte, sal, arna'mento > French pa:r (part), sel (sel), 
arnama (ornement). There clearly must have been a time when 
the Latin a-phoneme had a member tending towards e, which 
eventually joined the French £-phoneme derived from other sources. 
(For a > a, see §725.) 

723. A further example is offered by the Old English E-phoneme, 
which no doubt at one time had as a member a very open variety 
of £ used before j, which eventually joined the a-phoneme. Example: 
hejt(a) (written in Old English heorte), which subsequently became 
haJt and in modern times ha:t. 

724. Again, the first part of the development by which Latin 
intervocalic t eventually disappeared in French was no doubt a 
stage in which it was replaced by d. * Thus 'viita would in all 
probability have first become 'vi:da, a change from one phoneme 
to another, before becoming 'vi:3a (and eventually 'vi:a and vi). 

725. (9) Change in a subsidiary member resulting in its becoming 
a new separate phoneme. —This case is illustrated by the emergence 
of French s from Latin a. No doubt in early Vulgar Latin weakly 
stressed a had much the same value as strongly stressed a, as in 
Modern Italian. But there must have been a time when this 
phoneme as used in Gaul developed an obscured a as a member 
employed in weakly stressed positions. Eventually the French 
sound a emerged from this. Thus Vulgar Latin orna'mento 
became in French ornama. The appearance of a as a separate 
phoneme would appear to date from the time when the vowel 
in such words began to be written with e, which coincides with 

* This is on the assumption that Classical Latin ae was pronounced ae. 
It is possible, however, that Classical Latin ae was pronounced £: from earliest 
times, as it doubtless was in Vulgar Latin. If this were the case, the short E 
would naturally have belonged to the same phoneme as the £1. 
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the period when other unstressed Latin vowels became obscure, 
i.e. about the tenth or eleventh century. 10 

726. (10) Appearance of a new phoneme by coalescence of two 
sounds. —In some forms of English sequences of vowel + j have 
coalesced into single sounds, namely, “j-coloured” vowels. Such 
a sound is (j-coloured a) which may be heard from many South- 
Western dialect speakers in words like ha^t, 'ga^dn {heart, garden). 11 

727. (1.1) Change in a subsidiary member, with the result that 
it coalesced with an adjacent sound to form a new phoneme. —This is 
exemplified in the change I > u in Old French. It is clear that at 
an early period the l-phoneme had as a subsidiary member a "dark” 

I (+) used before consonants. Statements to this effect were made 
by Latin grammarians, and Early French spellings with I (such as 
altre, chevals, chevels) indicate that a l-like sound was probably 
still used before consonants. The t must ultimately have been 
replaced by the acoustically similar sound u, and, as there is no 
reason to think that the syllables consisting of vowel + 1 ever 
became disyllabic, this u must have been non-syllabic and must 
immediately have combined with the preceding vowel to form a 
diphthong—a new phoneme. Thus altra (written altre), which 
came from Vulgar Latin 'altero, would very easily have become 
autra (written autre). This in turn eventually developed into the 
present-day o:tr(a), after no doubt passing through some such 
intermediate stage as 'aotra. The change from 1 to u in such 
words appears to have begun not later than the tenth century 
and to have been complete by the twelfth century. 

728. (12) Change in the conditions under which a sound is used 
involving separation from the phoneme to which it formerly belonged .— 
Such separation may take place as the result of the elision of a 
“determining” sound. 12 It may also take place as the result of 
lengthening sounds previously short or shortening sounds previously 

10 See Nyrop, Grammaire Historique de la Langue Franfaise, Vol. I, §§162, 
253, 257 and elsewhere. 

11 For the occurrence of such sounds in American English, see L, Sprague 
de Camp on American Retroflexed Vowels, in Le Maitre Phonetique, July, 1942, 
p. 22. I am disposed to think that the American syllabic j (which = 
should be regarded as a member of the r-phoneme, differing from the American 
consonantal r mainly by its length (see §750). 

14 A sound which determines what member of a neighbouring phoneme in 
a sequence is to be used. 
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long, and doubtless also in other circumstances. The following 
are examples of the first of these cases. An example of the second 
is given in §734. 

729. In Early French the nasalized vowels were evidently 
members of the ordinary vowel phonemes, viz. the members used 
when nasal consonants followed. Thus, since the French b5 (bon) 
is derived from the Latin ‘bonum, which gave rise to 'bo:no in 
one type of Vulgar Latin, it is clear that the nasality of the vowel 
must have been brought about by the influence of the following n. 
The complete development was no doubt of the type 'bonum > 
'bo:no > 'b5:no > bon > bo. Similarly with m, as in 'lampa > 
'lampa > la:p ( latnpe ). It was during the last stage of these develop¬ 
ments that the nasalized vowels must have changed their phonetic 
status. As long as the nasal consonant was present these vowels 
would naturally have belonged to the ordinary vowel phonemes, 
but as soon as the nasal consonants disappeared, they must have 
become separate phonemes as they are to-day in Northern French. 
It is probable that this was a comparatively recent occurrence. 

730. The Old English f-phoneme furnishes another example of 
the conversion of a subsidiary member into a separate phoneme 
through elision of a determining sound. The principal member 
of this phoneme was f, 13 but there was evidently a subsidiary 
member v used between voiced sounds. Thus feon (now far) and 
hlaf (now loaf) were no doubt pronounced fsj, 4a:f, but it is clear 
that drifan (now drive) must have been pronounced 'driivan or 
'driivan. When the final a of words like drive, mile ceased to be 
sounded—an elision which apparently began to take place in the 
fourteenth century 14 —v must have become a phoneme separate from 
f, since the elision brought v into final position in many words: 
Middle English 'driiva became dri:v, 15 and so on. 

731. The Modern English g-phoneme probably originated in 
a similar manner. There is reason to think that in Old English g 
was a member of the n-phoneme and was used to the exclusion 
of n before k and g. The analogy of mb and nd makes it probable 


18 North of the Thames. South of the Thames there was as a rule no 
f-sound; V was probably used in all positions, except when immediately 
preceding breathed consonants. 

11 In Southern England; much earlier in the North. 

18 Later draiv and draiv. 
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that the written ng of Old English was pronounced gg, as it still 
is in the speech of many in the Midlands, and that the g of final 
-gg was dropped at the same period as the b of final -mb. Spellings 
indicate that this disappearance of b began in the fourteenth 
century. It would doubtless therefore have been about this date 
that g became a phoneme separate from n. 

732. An interesting example of the appearance of a new vowel- 
phoneme consequent upon the elision of a determining sound is 
afforded by a form developed from the earlier English sequence 
riu in the speech of some Americans. 18 In most types of English 
the i of this sequence has been elided and the modern pronunciation 
is simply ru(:). If, as is likely, the older sequence had a forward u 
[ii] as is the case with many modern speakers in such words as 
new, music (§89), this [ii] has generally reverted to the principal 
member of the phoneme; I believe most people pronounce brew, 
rule, etc., with the same shade of vowel as too, school. In some 
forms of American English, however, the forward u has been 
retained after the disappearance of the preceding i or j. This 
type of English consequently has a phoneme ii separate from 
the u-phoneme. Brewed and brood are with them distinguished 
as briid, brud. 

733. The following example from Russian is of a similar nature. 
Some Russians use pronunciations like aii, aii in place of the normal 
sequence aju [aju, aju], They pronounce, for instance, znaiit, 
'slujaii, in place of the normal 'znajut, 'slujaju ['znajiit, 'slujaju] 
(3HaioT, they know; cjiymaio, I am listening). In this pronuncia¬ 
tion ii constitutes a phoneme separate from u, since the sequence 
au also exists in Russian. Therefore if, as may be supposed, the 
pronunciations au, aii are developments of older pronunciations 
aju, aju, these reductions exemplify the appearance of a new 
phoneme through the elision of j. 17 

16 That the sequence riu was in use in the seventeenth century was indicated 
by John Wallis, who in his Grammatica Linguee Anglicance quoted the 
word brew as an example of iu (p. 12 of the first edition, 1653). 

17 Those who favour a mentalistic view of the phoneme will probably not 
agree with the proposition that ii is a separate phoneme from u in the pro¬ 
nunciation of these speakers. They would, I believe, regard au and au as 
“realizations of the phonemic sequence aju.” Trubetzkoy, whose view was 
stated to be not “psychological” (see §§655 656), used a similar expression 
when mentioning this case (Grundziige der Phonologie, p. 56, Cantineau’s 
translation, p. 65). 
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734. Examples of lengthening causing separation of sounds 
from phonemes to which they are believed to have been formerly 
attached are seen in the form of modern Southern English in which 
the traditionally short vowels are lengthened. As has been pointed 
out in §519 and elsewhere, there are now many Southern English 
speakers who pronounce words like this, box and book with long 
vowels (3i:s, bn:ks, bo:k). The length relationship between these 
vowel qualities and those of such words as pi:s (piece), to:ks (talks) 
and bu:t (boot) is no longer maintained, so that the sounds i, d and o 
have in the speech of those who pronounce in this way become 
phonemes separate from the closer i, o and u of these latter words. 

735. (13) Appearance of a new phoneme through the influence 
of words borrowed from a foreign language. —There are two cases of 
this: (a) when a foreign sound is imported into the language, and 
(b) when the introduction of foreign words causes two members 
of a phoneme to become separate phonemes. 

736. Examples of (a) are the English diphthong oi (imported 
from French), the sound h in Norman French (imported from 
Germanic), the Hindi q, f, z, x, y (imported from Persian 
and Arabic). 

737. An example which can be seen in progress at the present 
day is the importation of a b-phoneme into Kikuyu. When the 
English word book is used in Kikuyu, it is said in the most natural 
speech with the Kikuyu phoneme mb, thus mbu:ku [- ■]. It seems, 
however, that many Kikuyu speakers give it an English pronunci¬ 
ation bu:ku with a sound b differing both from the Kikuyu mb 
and from the normal Kikuyu initial b (which = |J). 

738. An example of (b) is said to be observable in the pronuncia¬ 
tion of some Japanese who employ <{> before vowels other than u 
in borrowed European words. They use, for instance, $irumu 
as the equivalent of the English film, when the more natural 
Japanese pronunciation would be hwirumu [<(>uirumu]. 18 In 
ordinary Japanese <{> belongs to the h-phoneme, as shown in §77, 
but in the pronunciation of these speakers <|> constitutes a phoneme 
separate from h. 

739. The emergence of v as a separate phoneme in English, 
though primarily the result of elision of a determining sound, as 


*• Palmer, Principles of Romanization, pp. 87, 88. 
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explained in §730, may have been aided by the influx of French 
words beginning with v, such as voice, veal, which many people 
would doubtless have endeavoured to pronounce in the French way. 

740. (14) Appearance of a new phoneme through contraction .— 
Examples are the French j and w phonemes, which had their 
origin in syllabic i (or e) and u (or o). Thus the j of Ij5 [lion) comes 
from a syllabic e (Latin le'o:ne(m) 19 ), and the w of wi (oui) from 
a syllabic o (Old French o + H). Prior to these contractions it 
would seem that there was no j or w in French, the Latin j and 
w having both become fricative, the j changing towards 3 and z, 
and the w changing to v. 

741. (15) Disappearance of a phoneme. —Sometimes a phoneme 
disappears completely from a language. One of three things happens 
in this case: either (a) the sounds of the phoneme, or some of them, 
are replaced by other sounds already existing in the language, or 
(b) the sounds disappear leaving no trace, or (c) the disappearance 
of the sounds is compensated for by length or some other attribute. 

742. (a) is illustrated by the treatment of the sixteenth-century 
English e: subsequent to its development from Middle English e: 
referred to in §716. In the fifteenth or sixteenth century the sound 
appears to have become a phoneme separate from short e, as we 
have already seen; but shortly afterwards, probably in the course 
of the seventeenth and the beginning of the eighteenth century, 
people ceased using it altogether and substituted the long member 
of the i-phoneme of that period for every e:. 20 (We cannot assign 
any reason for this substitution. It would seem to have been 
detrimental to the language, since several pairs of words previously 
differentiated came to be pronounced alike: sea and meat came to 
be pronounced like see and meet.) 21 

12 Which appeared in Old French as leun (presumably pronounced le'on). 

20 According to Bishop Wilkins ( Essay towards a Real Character and a 
Philosophical Language, 1668, p. 363) the el was still in use in his day. He 
considered the sound to be the long of the vowel in bet, thereby (in our ter¬ 
minology) still attaching it to the same phoneme as the short E. 

21 There is no question of the e: becoming gradually closer and eventually 
reaching i:. The change from a fairly close e: to i: must have been a sudden 
one. since as there already was an i: in the language, it would not have been 
possible for an e: of more than a certain degree of closeness to exist simul¬ 
taneously with it. 
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743. Other illustrations of (a) are the substitution of I for the 
Old English 4, as in 4a:f > Middle English b:f > Modern English 
louf (loaf); the substitution of j for X in Modern French; the sub¬ 
stitution of a for Portuguese non-nasal e in Brazilian 22 ; the 
substitution of the b-phoneme for the Latin w (or v) phoneme 
in Spanish. 23 

744. An example of (6) is the disappearance of the Early English 
x-phoneme which may have comprised two or more clearly distin¬ 
guishable members. In many words this consonant ceased to be 
sounded, and left no trace: such are Old English he:x (written 
heah) which became in Middle English simply hi: (written hy, etc.) 
and in later English hsi and hai (high), and Old English 'cuxte 
(ahte) which became in Middle English b:xta (or perhaps buxta) 
and eventually o:t (ought). 

745. Case (c) is dealt with in §§750 ff. 

746. (16) Disappearance of a subsidiary member of a phoneme .— 
Sometimes, while a phoneme continues to exist, one of its members 
disappears. The Early French d-phoneme affords an example of 
this. That this phoneme had d as principal member is evidenced 
by the fact that Latin initial d appears also as d in Modern French, 
as in 'durrum > 'du:ro > dy:r (dur). It is, however, fairly certain 
that the phoneme had a subsidiary member of the 9-type in inter¬ 
vocalic position. The evidence for this is (1) that Latin d in this 
position has disappeared in Modem French ('nu:da > 'ny:a > ny(:), 
etc.), and that a preliminary weakening of d to 0 would have 
naturally preceded this disappearance, (2) that the spelling dh is 
found in Old French texts in positions corresponding to Latin 
intervocalic d. The ultimate disappearance of this 0 is thus the 
disappearance of a subsidiary member of a phoneme. It seems 
to have begun in the speech of some people about the tenth century, 
and to have become general by the twelfth century. 

22 As in para (Portuguese ■pare, Brazilian 'para). The Portuguese nasalized 
£ has remained unaltered in Brazilian speech. Incidentally, the phonemic 
classification of the nasalized vowels in the Portuguese of Portugal and Brazil 
is somewhat obscure. (See Jones and Dahl, Fundamentos de Escritura 
Fonitica. p. 17.) 

28 This latter substitution presumably took place simultaneously with the 
change in the principal member of the b-phoneme from b to P (§711). 
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747. (17) Replacement of a sound by a sequence of two sounds — 
An example of this is the replacement of Latin short £ in stressed 
open syllables by je and is in French, as in 'febre(m) > fje:vr(a), 
'br£ve(m) > Old French brief (at one time no doubt pronounced 
brief, in one syllable). This development may have started with a 
narrow diphthongization of the type ee which was gradually 
widened to ie. It is perhaps impossible to discover at what date 
the diphthong became a sequence of two sounds. It may be 
remarked, however, that there would probably have been a long 
period during which it would have been possible to regard the 
ie either as a single diphthongal sound constituting a phoneme 
or as a sequence of two sounds belonging to two phonemes. It 
was no doubt the appearance of disyllabic ie from other sources, 
and particularly as the result of the elision of intervocalic d (3), 24 
which finally caused the diphthongal ie of such a word as brief to 
become disyllabic. As for the alternative form je, which is 
practically indistinguishable in sound from the rising diphthong 
ie, it may be presumed to have been thought of as a sequence of 
two sounds as soon as sequences like ja and jo appeared in the 
language 25 ; for the existence of such sequences would mean that j 
was definitely a phoneme of the language. 

748. Another replacement of a sound by a sequence of two 
sounds occurred during the development of Latin e: into French 
wa (me: > mwa, etc.). This development was presumably of the 
type e: > el > a! > oe > oe (which practically = we) > wa. Here 
again there must have been a period during which the sound could 
be regarded either as a diphthong or as a sequence of two sounds. 
The diphthong would doubtless have to be considered definitely 
as having become a sequence of two sounds at the time when w 
became established as a phoneme by the reduction of disyllabic 
o' i to the monosyllabic wi in the word oui (§740). 

749. Yet another example of the replacement of a sound by a 
sequence of two sounds is seen in the development which has given 
rise to the modern English ju:. In Old English the corresponding 

24 As in Old French oblider (probably pronounced obli'der) which has 
become in Modern French ublie ( oublier ). (The other modern form ublije 
is doubtless a quite recent development of ublie, formed on analogy with 
brije (briller), etc., after X had been replaced by j.) 

25 See §740. 
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sound is believed to have generally been a diphthong eu or in 
some words a diphthong eu: eu ( eow , you), neu ( neowe , niwe, newe, 
new), kneu ( cneow, knew), deu ( deaw , dew), feu ( feawe , few). It 
seems pretty certain that the first of these became iu at an early 
date, and that eu caught up with it later (perhaps not until the 
sixteenth or even the seventeenth century). 26 These were no 
doubt single diphthongal sounds until the modern pronunciation 
ju:, which can hardly be regarded as other than a sequence of two 
sounds, supervened. 27 

750. (18) Replacement of a sound-quality by a length. —Sometimes 
when a sound has been dropped compensation has been made by 
lengthening a neighbouring sound in the words affected. The 
disappearance of r before consonants in South Eastern English 
provides an example. Words like heart and purse doubtless had at 
one time (fifteenth and sixteenth centuries) pronunciations of the 
type hajt, pajs, 28 etc. The j’s in such words were doubtless 
retroflex continuants with vowel-colouring corresponding to the 
preceding vowel, these sounds being members of the r-phoneme. 
Subsequently in the Court language these j’s ceased to be sounded, 
and there was left in their stead a lengthening of the vowel; hence 
resulted the modern South-Eastern pronunciations ha:t, pa:s, etc. 
A second chroneme thus came into being, affecting the sounds a 
and a which previously had been subject only to one chroneme 
(short). 

751. The different development of words like purse in South 
Western English and in American English illustrates the same 
process. There it was the a which disappeared, while the j remained 

28 John Wallis in his Grammatica Linguae Anglicanae (1653), p. 63, said 
that few, beauty were pronounced more correctly with " e clarum” and w,- 
but that some people pronounced “ more sharply ” as if the words were written 
fiw, biwty. 

27 This may have been in the seventeenth century. The question is dis¬ 
cussed in Vietor’s Shakespeare's Pronunciation, Vol. I, Chapter V. 

28 More precisely pyjs. The vowel in this word was no doubt derived 
from a previous short u. Most of the short u's of Middle English are repre¬ 
sented in Modern English by A, having presumably undergone a development 
of the type u > Y (a variety of a) > A: see my article on Shakespeare in the 
Original Pronunciation in English Pronunciation through the Centuries, p. 39, 
§1 (Linguaphone Institute). But when j followed, this development would 
seem to have been arrested (in South Eastern English) at the time of the 
disappearance of the j. 
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and received a compensatory lengthening resulting in pronunciations 
of the type pj:s. It will be observed that in neither of these develop¬ 
ments of purse was there any change of phoneme. 29 It was the 
manner of using the phonemes in particular words that altered. 

752. Another example of the appearance of a new chroneme 
by compensation for the loss of a sound is seen in French words 
like m£:tr (maitre), am (ane) which replace earlier forms msstra 
(< Latin ma'gistrum), asna (< Latin ’asinum). 30 

753. Yet another example is afforded by the treatment of 
French “feminine e” in some parts of France and in Belgium and 
in French-speaking Switzerland. In these localities various 
masculine and feminine words are distinguished by vowel-length: 
bout and boue are pronounced bu and bu: respectively, ceil has a 
short vowel (oej) and feuille a long one (foe:j), and the masculine and 
feminine forms of such an adjective as original are distinguished as 
ordinal ( original) and orijina:! ( originale ). S1 These lengthenings of 
vowels in feminine words can hardly be other than substitutions 
for a formerly pronounced s, bu: replacing an earlier bua, 32 ori 3 ina:l 
replacing an earlier ordinals, and so on. The development is 
presumably a comparatively recent one. 

754. Sometimes a sound has disappeared from a sequence 
leaving compensation in the shape of syllabic character in another 
sound of the sequence. This is a special case of a substitution 
of a length for a sound-quality, since in order to be syllabic a sound 


28 Except that subsequently the American J has acquired a quality differing 
somewhat from the South Western English J. As far as I am able to judge, 
the syllabic South Western and American j’s still belong to the same phoneme 
as the consonantal r; its use appears to me to be comparable to the use of 
syllabic and non-syllabic I. 

30 It is believed that this disappearance of S was complete by the fourteenth 
century (Bonnard and Salmon, Grammaire Sommaire de VAncien Franfais, 
p. 65). 

81 These distinctions are not made in ordinary Parisian speech. In that 
type of French all final vowels are short, boue being pronounced bu, like 
bout] and with most speakers final j is accompanied by lengthening of the 
vowel, ceil being pronounced ce:j, and the feminine termination -ale is pro¬ 
nounced with a short vowel like the masculine -al (-al). 

82 Earlier still probably bas. (See Nyrop, Grammaire Historique de la 
Langue Franfaise, Vol. I, §189, where djDS is given as a probable early 
form of joue.) 
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must be comparatively long. Examples are seen in such English 
words as mutton and sudden which are now 'rriAtn, 'sAdn with, 
syllabic n, but which must be presumed to have been said with a 
vowel between t or d and the n when the words were first taken 
into the language from French. 

755. (19) Appearance of a new phoneme through a change of 
length. —As has been pointed out in §§510 ff. and elsewhere, minimal 
distinctions are often effected by complexes of quality and duration. 
If in the course of time changes of length occur in such cases, the 
distinction comes to be by quality only, so that new phonemes 
appear. One case of this has already been quoted in §734. Another 
is the shortening of the traditionally long and close i in the Scottish 
pronunciation of such words as week, keen, which has caused the 
presumably older short and open i (t) to become a separate phoneme 
in that form of English (see §520). 33 

756. (20) Replacement of a sound-quality by a tone. —It will no 
doubt be found on examination that sound-qualities have sometimes 
been replaced by tones. It seems likely, for instance, that Chinese 
dialects which at present do not make use of final p, t, k or ?, 
possessed sounds of this kind at an earlier period, and that sometimes 
when the sounds disappeared special tones took their place. It 
is probable too that the two tones of modern Panjabi are derived 
from aspirations in an earlier form of that language. (The low- 
rising tone, denoted in Grahame Bailey’s Panjabi Phonetic 
Reader by £ , corresponds to the aspirations of the consonants bh, 
dh, etc., of Urdu, while the high-falling tone A corresponds to 
the breathy voice often applied to Urdu vowels. 34 Compare 
Panjabi k { ar (house), bSot (much) with Urdu ghar, bsoht.) 

757. In languages making use of significant length or stress 
or voice-pitch one may expect these attributes to undergo modifica¬ 
tions in course of time. One may expect also that the modes of 

33 It will be seen from what has been said in this and previous paragraphs 
that new phonemes can come into a language in a variety of ways, the chief 
of which are (a) introduction of a sound from a foreign language, (6) isolative 
change resulting in detachment from an old phoneme (§§716, 717)., (c) elision 
of a “determining” sound (§§729-733), id) contraction (§740), [e) change of 
length (§§734, 755). There are also cases for which we are unable to account, 
such as the appearance of a in addition to a in modern Northern French. 

34 Represented in romanic writing by h following the vowel. 
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using them should sometimes change. The significant function 
of these attributes may be preserved in spite of such alterations, 
or on the other hand their significant function may disappear. 
Moreover, one of the attributes may come to have significant function 
in a language where it was previously not so used. Below are 
examples of some types of change in these attributes. There are 
doubtless many more. 

758. (21) Change in length not effecting significance. —Example: 
the (apparently recent) lengthening of French a before b, d and g, 
as in ta:bl, ba:g for the more usual and presumably older tabl, bag 
(table, bague). 

759. (22) The appearance of significant length where it did not 
exist previously.— Examples: the appearance of significant length 
affecting certain French vowels and exemplified in ms:tr ( maitre) 
and mEtr ( mettre ), and affecting the Scottish English close i and 
exemplified in the Scottish pronunciation of grid (greed) (with 
short close i) and A'griid (agreed). (See also the examples in 
§§750-753.) 

760. (23) The disappearance of significant length, as for instance 
in Vulgar Latin, 36 or in the English of those who pronounce such 
words as thick, stop, book with full length ( 0 uk, stmp, bo:k ). 37 

761. (24) Change in a type of stress, as for instance when a 
falling diphthong is replaced by a rising diphthong (see example 
in §748). 

762. (25) Shifting of the position of strong stress, as in many 
English words of French origin, e.g. courage, which no doubt first 
appeared in English in the form ku'raidja and has since become 
1 kAridj . 38 

763. (26) Changes in the values of tone in tone languages, such 
as those which must be presumed to have occurred in various 
Chinese languages, if they have a common origin. 

35 See §§377, 520. 

36 Length, which had been significant in Classical Latin, remained a note¬ 
worthy feature in Vulgar Latin, but it became linked with stress and was no 
longer significant. 

37 See §§130, 734,' footnote 8 to §513, footnote 1 to §592. 

33 Changes in the position of strong stress in English have generally been 
accompanied by changes of sound-quality. 



THE HISTORY OF LANGUAGES 251 

764. (27) Appearance of new tones and, disappearance of old 
ones in tone languages, as evidenced by the varying numbers of 
tones in present-day Chinese languages. 39 

765. (28) Substitution of strong stresses for special tones, such 
as that believed to have taken place in Greek. 

766. (29) Complete disappearance of significant tones from a 
language which was previously a tone language, without replacement 
by any other attribute, as is believed to have happened in 
Swahili. 

767. Other possibilities are the replacement of length by stress 
or by tone, the replacement of stress by length or by tone, the 
replacement of tone by length or by stress, and many changes 
affecting complexes of attributes. 

768. It should be added that though the attributes length, 
stress and voice-pitch can be and are employed for the purpose 
of distinguishing words, their use for this purpose is rather restricted. 

30 The number is said to vary from four to nine. (See B. Karlgren, 
La Phonologie Chinoise, Vol. I, Chapter XVI.) It should be noted, however, 
that Karlgren’s statement that Cantonese contains nine tones ( Phonologie 
Chinoise, p. 591) needs some amplification. If “tone” is used as a translation 
of the Cantonese terms _SCet) (tone of the upper series) and _ha (tone of the 
lower series), it means not only musical pitches but also what the Chinese 
call “entering tones,” i.e. musical pitches combined with a final stop consonant 
(unexploded p, t or k). In the classical style of Cantonese there are nine 
of these, six being musical pitches without final stop and three being musical 
pitches with final stop. They are: high fall (with variant high level in the 
same toneme), high rise, mid level, low fall (with variant very low level in 
the same toneme), low rise, low level, high with stop, mid with stop, low with 
stop. In colloquial Cantonese there are, in addition, a high rise with stop 
and a very high level (a separate toneme), making eleven in all. 

If one adopts what seems to me to be the better course of using the term 
“tone” in its common European sense of musical pitch only—the stops being 
taken for what they really.are, viz. articulated sounds—the numbers are: 
for the classical style of Cantonese six (high fall, high rise, mid level, low fall, 
low rise, low level) and for colloquial Cantonese seven (the same with the 
addition of the very high level). The “upper entering tone" (i.e. high pitch 
with stop) has the same pitch as the level variant of the high falling, the mid 
level with stop has the same pitch as the mid level, and the “lower entering 
tone” (i.e. low pitch with stop) has the same pitch as the low level. 

According to Karlgren ( Phonologie Chinoise, p. 594) there are, in other 
types of Chinese, cases where one tone has split into two and where two 
different tones have merged into one. 
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Very extensive use of them is, however, made for the purpose of 
superimposing meanings on sentences. The ways in which they 
are employed to fulfil functions of this type are very varied and 
complicated, and it is only to be expected that the methods of 
applying them in sentences should undergo considerable variations 
in course of time. 40 This subject does not, however, come within 
the scope of this book. 


769. I would conclude this chapter by pointing out that the 
investigation of phonemic changes occurring in the history of 
languages may prove to be of assistance towards determining the 
nature of the phoneme itself. It would be useful, for instance, 
to ascertain whether such developments as a change causing a 
sound to leave one phoneme and join another, or the substitution 
of a length for a sound-quality, are compatible with a mentalistic 
view of the phoneme. It is to be hoped that such questions will 
in due course receive the attention of specialists in the psychological 
aspects of phonetics. 


10 The following are two examples. French has evolved a special method 
of denoting intensity-emphasis by strong stress combined with a particular 
type of intonation (see Coustenoble and Armstrong, Studies in French 
Intonation, Chapters XIX, XX) and contrast-emphasis by another type of 
intonation (see Chapters XXIII, XXIV of the same book). Cantonese and 
Pekingese have different ways of giving prominence to a word in the sentence. 
In Cantonese it is done by giving extra length to the words, while in Pekingese 
it is done by making the non-prominent words "toneless" or nearly so. If, 
as is supposed, the two languages are derived from a common original, one 
or both of these methods of rendering words prominent must have come 
into existence by some sort of evolution. 
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APPENDIX I 

THE HISTORY AND MEANING OF THE TERM 
“PHONEME” 

1. The purpose of this Appendix is to give some account of the origin 
and use of the term "phoneme” as it has been employed in the Department 
of Phonetics at University College, London, since about 1916—the principle 
of which was passed on to me by pupils of Baudouin de Courtenay and 
collated with the work of Henry Sweet and Paul Passy, and made more 
specific in the Department itself as the result of applying the basic idea to 
a number of languages which had not previously been examined from this 
point of view. A few remarks on cognate matters are also included, and a 
suggestion is made for introducing a new term, which, if adopted, will go 
a long way towards bringing about uniformity of phonemic terminology in 
various countries. 

2. First it must be said that the idea of the phoneme, as distinct from 
the formulation of the theory, is very ancient. In fact by its nature it must 
date back to the times when people first bethought themselves of writing 
down languages by an alphabet instead of using a pictorial system. For 
people possess what the eminent American linguistician Edward Sapir 
(1884-1939) called "phonemic intuitions,” which come into action as soon 
as they begin attempting to write their own languages alphabetically. They 
work with phonemic intuition as long as they are phonetically untrained, 
and as long as they remain uninfluenced by alphabetic traditions (which 
always grow up sooner or later). They know by a sort of instinct which 
differences between speech-sounds are capable of distinguishing words 1 in 
their own languages, and as a rule they do not notice other phonetic differences 
which may exist but which are not capable of distinguishing words. In other 
terms, it is natural that in their early attempts at representing their languages 
by means of an alphabet men should write them phonemically. 

3. The fact is well demonstrated by the system by which the Korean 
language is written. It is alphabetic, and was devised by the Korean king 
Se-Jong about the year 1460 a.d. This monarch quite evidently possessed 
a feeling for the phonemes of the Korean language. To give but one instance, 
he represented the sounds p‘ (slightly aspirated p) and b by a single letter (H). 
He is believed to have been quite a good phonetician, and doubtless realized 
that the two sounds were different; but as the sounds only occur in Korean 
in complementary distribution (b occurring to the exclusion of p* between 


1 Sapir pointed out, after long experience of reducing American Indian 
languages to writing, that untrained native speakers generally have no diffi¬ 
culty in deciding what portions of connected speech constitute "words." See 
particularly his book Language, p. 36, footnote 6. 
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voiced sounds, and p‘ being used to the exclusion of b in other situations), 
they have a kind of "sameness" to Koreans. That "sameness” is phonemic: 
the two sounds belong to a single phoneme, and that is why it is proper to 
represent them in writing by a single letter.* 

4. The existence of "phonemic intuitions" was insisted upon repeatedly 
by Sapir. It came to his notice in the course of teaching American Indians 
to write their languages. He observed how they continually and naturally 
noted differences of sound which “mattered" (to them) and took no account 
of differences that did not "matter” (to them): to put the case in modem 
terminology, they recorded differences of sound which were phonemic but 
but ignored those which were not. Hence Sapir’s frequent use of such 
expressions as "psychological values" of sounds, the "inner” or “ideal" system 
of a language (as contrasted with its objective system of sounds),* "phonemic 
intuitions" and "the psychological difference between a sound and a 
phoneme.” 4 

6. People’s natural sense for phonemes is also observable when teaching 
children to spell. For instance, it does not occur to a child who speaks my 
kind of English that the I-sound in leaf ought to be written with a different 
letter from the I-sound infield. It can be shown to him (if the teacher thinks 
there is any point in doing so) that the sounds are not the same—one being 
a "clear I" and the other a "dark I,” but we speakers of Southern English 
feel them to be in some way "the same.” This feeling of "sameness” arises 
from the fact that the two sounds are in complementary distribution: "clear I” 
is used (to the exclusion of "dark I") before il and other vowels, whereas 
"dark I" is used (to the exclusion of “clear I”) before consonants and finally. 
The two sounds belong to a single phoneme, which means that the distinction 
just "doesn’t matter” as far as the child learning to read is concerned. 

6. Turning now to the question of formulating the theory of phonemes, 
it must be stated to begin with that the first linguistician to enter this field 
of enquiry and to point out the distinction between the "phone” (speech- 
sound, Russian zvuk) and the “phoneme” (Russian fonema) was almost 
certainly Jan Baudouin de Courtenay (1846-1929).* He was a philologist 
of Polish origin who established himself in Russia, first as a Privat docent at 


* The Korean language also possesses a completely unaspirated (sometimes 
glottalized) p and a strongly aspirated ph. These sounds belong to separate 
phonemes and are therefore properly written in the Korean alphabet with 
separate letters (HU and 31). 

* See Sapir, Language (1921), pp. 66-68. 

4 See Sapir, La Rialiti Psychologique des Phonemes (in the Journal de 
Psychologic, XXX, Nos. 1-4, Jan.-April, 1933, pp. 247-266). An English 
translation of this article is contained in the Selected Writings of Edward Sapir, 
edited by D. G. Mandelbaum (University of California Press, 1949). 

* Baudouin de Courtenay stated more than once that the word fonema 
was the invention of a student of his named Kruszewski. The formulation 
of the theory was, however, the work of Baudouin de Courtenay. 
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St. Petersburg, then as Professor for eight years (1876-83) at Kazan, where 
he created his famous school of linguistics. Later he held professorships at 
Dorpat (1883-93), Cracow (1893-1900) and eventually St. Petersburg (1901-18) 
where he continued to develop his school. He spent the last years of his 
life in Poland. He appears to have worked out the fundamental principle 
of the phoneme during the 1870's. He did not, however, write much on this 
theme, and in fact no clear exposition of it appeared in print until 1894 when 
he published his Pr<5ba Teorji alternacyj fonetycznych (Cracow). A German 
version of this, Versuch einer Theorie phonetischer Alternationen, was published 
at Strassburg in 1896. His teachings, however, eventually permeated into 
Western Europe, and especially to England, mainly orally through his pupils.* 
7. The idea of the phoneme was recognized independently, also in the 
1870’s, by the English pioneer Henry Sweet (1846-1912), though he did not 


• I am indebted to Dennis Ward, Head of the Department of Russian in 
the University of Edinburgh, for many of the above particulars. They are 
for the most part taken from the article on Baudouin de Courtenay in 
the B ol' shay a Sovetskaya Entsiklopediya (Large Soviet Encyclopedia), 2nd 
edition (1961), Vol. 6, p. 366. Ward has translated this article into English 
and has kindly given me a copy with permission to quote from it. 

The following is his translation of the paragraph in the article which deals 
with Baudouin de Courtenay’s work on the phoneme: 

“His chief merit in world science is the construction of the theory of 
phonemes and phonetic alternations. He had been working out this theory 
from 1868, thus forestalling Western European linguistics by nearly 40 years. 
He starts from the position that the role of sounds ‘in the mechanism of a 
language, for the feeling of the people’ does not coincide with their physical 
nature, and that this non-coincidence obliges one to distinguish ‘phonemes’ 
from ‘speech-sounds’; in his theory he subordinates the phonetic side of speech 
to the social function of language as a means of communication and a form 
of thinking. He states not only the mutual relationships of phonemes, but 
also the ways in which they are formed historically. A radical failing of this 
theory, as laid down by Baudouin de Courtenay in his basic works, is the 
psychological concept of the phoneme; nevertheless in one of his works 
(O nekotorych Otdelakh sravnitel’noi Grammatiki slovyanskikh Yazykov [Some 
Branches of the Comparative Grammar of the Slavonic Language], 1881) he 
showed the possibility of working out a theory of phonemes and phonetic 
alternations without recourse to any subjective idealistic premise.” 

Some further particulars concerning the early work of Baudouin de 
Courtenay are to be found in Z. Arend, Baudouin de Courtenay and the 
Phoneme Idea in he Maitre Phonitique, Jan., 1934, p. 2 ; J. R. Firth, The Word 
Phoneme in Le Maitre Phonitique, April, 1934, p. 44, and C. L. Wrenn’s 
article on Henry Sweet in the Transactions of the Philological Society, 1946 
p. 189 (footnote). To tell the full story (if it can be discovered at all) would 
require close reading of the works in Russian, Polish and the German men¬ 
tioned above, together with the Polish memoir of Baudouin de Courtenay 
by Ulaszyn (Poznad, 1934), and searching for other relevant material. 
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employ the term “phoneme." His realization of the principle was shown by 
the distinction he drew between "broad” and "narrow” types of phonetic 
transcription—a distinction that corresponded in one fundamental aspect to 
what we now call “phonemic" and "allophonic” transcriptions. 7 * * As C. L. 
Wrenn has said: "Sweet may virtually be regarded as a co-equal with 
BAUDOutN de Courtenay in discovering the phoneme. One cannot be sure 
whether Sweet or de Courtenay was the first to realize this new and most 
important conception, since they worked—the one in London and the other 
in Kazan in South Russia—in entire ignorance of each other’s studies."* 

8. Paul Passy (1869-1940) too, though he did not employ the term 
"phoneme,"* was aware at an early period of his career of the basic principle 
underlying the phoneme theory. I think he came to it independently, but 
he probably clarified his ideas on the subject in the 1880’s by studying the 
works of Sweet, for whom he had great respect. He is known to have had 
considerable correspondence with Baudouin de Courtenay from 1900 on¬ 
wards, but no records of this have been preserved, as far as I know. An 
early reference to the phonemic principle is to be found in the first formal 
statement of the aims of the International Phonetic Association (IPA) 10 which 
was drafted by Passy and was agreed to by the Council in 1888. The relevant 
paragraph ran as follows: "There should be a separate letter for each dis¬ 
tinctive sound; that is, for each sound which, being used instead of another, 
can change the meaning of a word.” 11 He repeated the principle subsequently 
on many occasions. The following is a characteristic quotation: "On a ddjk 
invents trop de signes pour des varies de sons n’ayant pas de valeur dis¬ 
tinctive ; c’est une manie funeste, qui risque de nous entratner de plus en 
plus loin—il n'y a pas de raison pour s'arreter—et qui finirait par rendre les 
textes phondtiques illisibles. Ne noter dans les textes que les differences 
significatives: c’est une rfegle d’or, dont on ne devrait jamais se ddpartir.” 1 * 

9. The phoneme idea was thus known in England, and to a very limited 
extent in France, 1 ’ at the time when I was first appointed to lecture on 


7 See Sweet’s Handbook of Phonetics (Oxford, 1877), especially pp. 103, 104. 
See also the article on Types of Phonetic Transcription which forms Appendix A 
to the 8th edition of my Outline of Rnglish Phonetics (Heffer, Cambridge, 1966), 
especially pp. 331, 332. 

* Trans. Phil. Soc., 1946, p. 189. 

* Except on one occasion, in Le Maitre Phonitique, Jan., 1931, p. 3, when 
he repeated "the golden rule of the practical phonetician” in these terms: 
“n’indiquer dans les textes que les differences distinctives, ou si l’on aime mieux 
ne represdnter par des lettres differences que les phonemes differents.” 

10 Then called the Phonetic Teachers' Association. 

11 The Phonetic Teacher, Aug., 1888. 

11 Le Maitre Phonitique, Oct., 1926, p. 29. 

18 French philologists had for a long time previously been using the term 
"phonfeme,” but always in the sense of “speech-sound” (synonymous with 
“son” or "son du langage”). The technical sense assigned to it by Baudouin 
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phonetics at University College, London (January, 1907). But it was some 
time before we in England realized the full import of the concept, and in 
spite of Passy's exhortations still more time elapsed before the theory came 
to be regarded as a basic feature in the teaching of phonetics here. 

10. The word “phoneme” in the sense attributed to it by Baudouin de 
Courtenay was first brought to my notice by L. SCerba (one of his pupils) 
in 1911, who referred to the concept in his pamphlet Court Exposi de la 
Prononciation Russe published by the IPA in that year. 14 About two years 
later the theory was explained to me more fully by another of Baudouin 
de Courtenay’s followers, Tytus Benni of Warsaw. The immense im¬ 
portance of the theory then became very clear to me, especially in its relation 
to the construction of phonetic transcriptions, to the devising of alphabets for 
languages hitherto unwritten or unsuitably written, and in general to the 
practical teaching of foreign spoken languages. Consequently by about 1916 
the theory began to find a regular place in the teaching given in the Department 
of Phonetics at University College. 

11. The term "phoneme,” though known to phoneticians in England round 
about 1916, did not become current in books for some little while. The theory 
was still in the process of being clarified, and the terminology was incomplete. 
I used to manage at that time without employing the term. For instance, 
I wrote on p. xiv of the Sechuana Reader by myself and S. T. Plaatje (Uni¬ 
versity of London Press, 1916): "The consonant sounds C, J, l|, and the vowel 


de Courtenay was almost unknown in France (except to Passy), and 
remained so until Andr£ Martinet came on the scene. The word "phoneme” 
is believed to have been invented (independently) with the meaning of 
"speech-sound” by Louis Havet, who used it in 1876, if not before. "Speech- 
sound” was the only meaning given to the term in Marouzeau’s Lexique de 
la Terminologie Linguistique (1933). The 2nd and 3rd editions of this work 
(1943 and 1961) were improved by the addition of a paragraph referring to 
what was described as an English use of “phonem” [sic.]. 

F. de Saussure, the Swiss pioneer in linguistics, likewise used the term 
"phoneme” regularly to mean "speech-sound.” It would seem, however, 
from a passage on p. 164 of his Cours de Linguistique Ginirale (first published 
in 1916) that he was beginning to get an inkling of the concept of the phoneme 
as we know it and also of the diaphone. But his explanation was obscure and 
of a negative character, and he never reached the point of recognizing the 
existence of allophones. 

14 His reference to the phoneme in that pamphlet was not made sufficiently 
explicit, and some confusion arose owing to two very unfortunate misprints 
in his sound chart. The symbols UJ and a were printed in black type (the type 
used to designate the phonemes) when they should have been in ordinary 
roman (to show that the sounds belonged to the same phonemes as i and a 
—facts which must have been known to SCerba as they were to Baudouin 
de Courtenay ; see R. Jakobson’s note on p. 102 of Trav. Cercle Ling, de 
Prague, II, 1929). 
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sound u are probably 16 ‘non-distinctive’ in respect to the sounds t, I, w and u. 
By this we mean that the substitution of the sounds t, I, W, U, respectively, 
for the sounds c, J, l), « would probably 16 never change the meaning of any 
word." A year or two later I would have said that the sounds c, J, l|, « 
"belong to” or “are members of” the t, I, W, U phonemes. 

12. At this point I must make mention of a decision that had to be taken 
about that time. The term “phoneme” as used by Baudouin de Courtenay 
was a phonetic one, 17 and I have never seen any reason to consider it other¬ 
wise. 18 A comparison between his work and that of Sweet and Passy showed 
that this phonetic concept can be viewed in two ways, the “psychological" 
and the “physical.” Viewed "psychologically” a phoneme is a speech-sound 18 
pictured in one’s mind and “aimed at” in the process of talking. The actual 
concrete sound (phone) employed in any particular speech-utterance may be 
the pictured sound or it may be another sound having some affinity to it. 
its use being conditioned by some feature or features of the phonetic context. 
This was the view taken by Baudouin de Courtenay and his immediate 
followers. Benni told me (about 1913) that they consequently recognized 
two kinds of phonetics: one was called by them “psychophonetics” and related 
to the pictured sounds ; the other was called “physiophonetics” and related to 
the concrete sounds actually uttered. Corresponding to these were two types 
of phonetic transcription: the “psychophonic” (representing only phonemes) 80 
and the "physiophonic” (representing sounds actually uttered). 81 

13. Viewed from the "physical” angle a phoneme is a family of uttered 
sounds (segmental elements of speech) in a particular language 88 which count 
for practical purposes as if they were one and the same ; the use of each 
member of the family (allophone) is conditioned by the phonetic environment, 
i.e. no one member ever occurs in the situation appropriate to another. 88 To 
use a modem technical term (invented, I believe, by M. Swadbsh), the 
members of the family are in “complementary distribution.” 81 We can see 


16 Further research showed that the word "probably” should have been 
deleted. 

16 The words “would probably” should have been deleted. 

17 It was stated by him to be so in his Prdba Teorji alternacyj fonetycznych 
and in his Versuch einer Theorie phonetischer Alternationen. 

18 The phonetic character of the phoneme was also recognized much later 
by the American linguistician Morris Swadesh, who wrote in 1934 “The 
description of the phoneme in terms of norm and deviation belongs to the 
science of phonetics” (Language, X, No. 2, p. 117). 

18 A “linear” or "segmental” element of speech. 

80 Now called “phonemic” or “linguistically broad” transcription. 

81 Now called “allophonic” or "linguistically narrow” transcription. 

88 “Language” here meaning one particular and consistent variety of a spoken 
language, i.e. what is now called an “idiolect.” 

88 E.g. in English the k-sound of call never occurs before an i; nor does 
the k-sound of king ever occur before 31. 

81 Not "free variation.” I find it necessary to postulate that a sound not 
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from the writings of Sweet and Passy that they would have subscribed to 
this view. Z. Arend, one of the followers of Baudouin de Courtenay, 
was also not averse to it. 

14. Both these ways of regarding the phoneme were thus in existence 
at the time to which I am referring (round about 1916). They are not incom¬ 
patible ; in fact they lead to the same practical results. Together they formed 
the foundation upon which a complete theory of the phoneme had to be built. 
Since little had been done at that time towards applying the theory to any 
language except Russian, it fell mainly to the members of the staff of the 
Department of Phonetics at University College, London (because no one else 
in Western Europe or America seemed to be interested at that time) to start 
the further development of the theory with the aid of materials obtained by 
examining the phonetic features of a number of other languages. 

16. Here I must interpolate a short explanation concerning the use of 
technical terms. I am of opinion, especially in view of what was said in 
§§ 2-6 above, that it is perfectly justifiable to take into account "mind,” 
"feeling," “impressions,” “notions,” "picturing” and other undefinable 
psychological terms in investigating the nature of the phoneme. I consider 
it justifiable too to postulate axioms (unproved assumptions). 16 Besides 
which, I hold that though observations capable of "exact” measurement 16 may 
have their uses, it would be erroneous to place reliance on them as sole 
admissible criteria in the determination of phonemes. 

16. When speaking or writing, all of us (including those who strive after 
"exactness”) make constant use of inexact, ambiguous and undefinable terms 
and axioms. This does not prevent our words from being very useful, and 
indeed indispensable, since people are endowed not only with reason but also 
with common sense (direct perception, intuition), and they rely to a large 
extent on these powers when listening or reading. We are accordingly 
generally able to make ourselves understood in spite of inexact wording and 
the use of undefinable terms. As L. R. Palmer puts it in his Introduction to 


belonging to a specified form of a language (idiolect) must be held to belong 
to another “language” (idiolect). Free variants must, in my opinion, be 
treated in separate categories ; they belong to what I have called "diaphones.” 
See Proceedings of the First International Congress of Phonetic Sciences (Amster¬ 
dam, 1932), p. 23 ; also the Explanations in the 1966 and subsequent editions 
of my English Pronouncing Dictionary, p. xxxv, and the Glossary of Phonetic 
Terms in the same work, p. 636; also Chap. XXVII of the present work. 

It should be noted that the diaphone is a much less definite concept than 
the phoneme, owing to the fact that a particular speech-sound often belongs 
to two or more diaphones. For instance, the sound AU belongs to the OU 
diaphone in types of London English, but to the au diaphone in Scottish 
English. 

16 H. J. Uldall's “primitive ideas.” See his Outline of Glossematics 
(Copenhagen, 1967), p. 36. 

16 In reality there appears to be no such thing as an exact measurement. 
All measurements are approximate. 
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Modern Linguistics p. 82: “speech is nothing more than a series of rough 
hints, which the hearer must interpret in order to arrive at the meaning 
which the speaker wishes to convey." 

17. For the above reasons I regard the psychological view of the phoneme 
as a tenable one. This is not, however, to say that the physical view is to 
be rejected. That view is equally tenable. In fact, when it became necessary 
for me to come to a decision between the two, I found it in the end impossible 
to escape the conclusion that the physical view of the phoneme is on the 
whole better suited to the needs of ordinary teaching of spoken languages 
and (in spite of Sapir’s experiences) for those who are called upon to reduce 
to writing languages hitherto unwritten or to improve upon existing unsatis¬ 
factory orthographies. I find the physical view more easily comprehensible 
to the ordinary student of languages than any other. At the same time I 
do not hesitate at times to resort to psychological criteria. 

18. Reverting now to the use of the word "phoneme,” I should like to 
put on record the fact that the first occasion on which I employed the term 
outside the class-room or in private conversation with my colleagues was in 
a lecture on The Sechuana Language given to the Philological Society on 
4th May, 1917. Unfortunately, the part of that lecture dealing with the 
general conception of the phoneme was omitted from the Proceedings. It 
may therefore be of some historical interest, as exmplifying the state of our 
knowledge of the phoneme at that date, if I reproduce here the words I used 
on that occasion. They are copied from the lectures notes which I happen 
to have kept. They include words and forms of expression which were shortly 
afterwards found to be inadequate; these are printed here in italics, and the 
wordings which would have been used later are put in footnotes: 

“It is rather difficult to say precisely what the number of speech-sounds 
is in Sechuana. It depends how you define a speech-sound. We know that 
in every language certain so-called speech-sounds vary in value to some extent 
according to their surroundings. M (Examples: English k in key, call, n in 
bun, month, e in get, well.) But in counting up the English sounds, we do not 
generally count these varieties as separate sounds; we ignore such incidental 
variations because they are not *• what is called "significant”; the meaning of 
words does not in any way depend upon them. 

"In order to make matters clear, it will be found convenient in dealing 
with any language to make a distinction between 'speech-sounds’ proper and 
what may be termed ‘phonemes' or the significant phonetic 30 elements of speech. 
Speech-sounds should be considered as definite sounds incapable of variation ; 


* 7 Published by Macmillan. 

a8 certain different though related sounds of a given language count as if they 
were one and the same, the use of one or another being prescribed by the phonetic 
context. In more modem terminology they are said to be in "complementary 
distribution” and are called "members of the phoneme” or "allophones.” 

*’ incidental use of variant sounds, because the differences between them are 
not . . . 

*° segmental or linear. 
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a phoneme might be defined as a normal sound of the language together 
with all its incidental variants. These two k’s in English [in key and call] are 
different sounds but the same phoneme. 11 

"In such languages as English and French, phonemes do not as a rule differ 
greatly from speech-sounds Incidental variations are on the whole slight. 
But in other languages cases may be found where very different speech-sounds 
count as being 83 in the same phoneme. Russian is, of course, the most notable 
instance of this kind of language; there each vowel phoneme, for instance, 
has a whole set of easily distinguishable values 4* depending on its situation 
in the word or sentence.” [Here followed two examples from Tswana 
(Sechuana): the I-phoneme, which has a member J (a phone sounding between 
I and d) used to the exclusion of ordinary I before i and U, and the u-phoneme 
which has as a member a very advanced variety "approaching French y" 
used exclusively when a following syllable contains i.] 

19. By 1918 the terminology was straightened out, as will be seen from 
the short but very nearly adequate explanation of the phoneme given in 
§§4-9 of the Introduction to the Colloquial Sinhalese Reader written in that 
year by the Sinhalese phonetician, psychologist and educationist H. S. Perera 
and myself and published early in 1919.“ As that book is now difficult to 
come by, I reproduce here the section relating to the phoneme; it is headed 
"Sounds and Phonemes””: 

"4. A speech-sound is a sound of definite acoustic quality produced by the 
organs of speech. A given speech-sound is incapable of variation. 

"6. Most languages contain a very large number of distinguishable speech- 
sounds. But fortunately it is not necessary in phonetic writing to have separate 
symbols for each sound, owing to the fact that many of the sounds fall into 
groups called phonemes. 

"6. A phoneme is defined as a group of related sounds of a given language 
which are so used in connected speech that no one of them ever occurs in 
positions which any other can occupy. 

"7. Thus the k's in the English words keep, call, are distinct speech-sounds 
but belong to the same phoneme (the English k-phoneme). This is because 
the first variety of k only occurs before the sound i:, and the second does 
not occur in that position in English. The two kinds of k can without 
ambiguity be written with the same letter (k) in phonetic writing. Likewise 
the n's in the Sinhalese words kanzda ("hill”), kanz8ia ("mound”), are different 
sounds but they belong to the same phoneme, viz. the Sinhalese n-phoneme. 
The first kind of n only occurs in Sinhalese before t and d ; the second only 
occurs before *t and 8.*‘ 


11 but they belong to the same phoneme. 

** do not as a rule comprise many members (allophones) and in only a few cases 
do they comprise members differing widely from the "norm" (principal member). 
** belong to. 

** members (allophones). 

** Manchester University Press. 

*• The symbols t and 8 (first proposed by the Rev. J. Knowles) were 
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“8. Speech-sounds which belong to the same phoneme cannot distinguish 
one word from another; failure to distinguish them on the part of a foreign 
learner may cause him to speak with a foreign accent, but it will not as a 
rule make his words unintelligible. On the other hand, if the foreign learner 
confuses one phoneme with another, he will confuse different words of the 
language. 

"9. It is generally only necessary in phonetic writing to have symbols for 
the phonemes. The use of the different sounds belonging to any given phoneme 
is, in most languages, determined by simple rules which can be stated once 
for all, and which can be taken for granted when reading phonetic texts.” 

20. The above explanations with a few verbal amendments to the definition 
(see below) hold good to-day, and the theory as it then stood has formed the 
basis of the phonetic work at University College ever since. 

21. Further developments in phonetic theory followed very shortly. One 
of the most noteworthy was initiated by the American phonetician D. M. 
Beach, who came to work in the Department in 1919. He continued to study 
and do research work there for nearly four years, gaining a Ph.D. for a 
remarkable thesis on the Phonetics of Pekingese . a7 

22. Beach was, I believe, the first to show that groupings analogous to 
those of phones into phonemes are to be found in connection with one of 
the “sound attributes.”* 8 One day—it was, as far as I remember, about 
February, 1921 —he gave a lecture in the Department of Phonetics at Uni¬ 
versity College, London, on the Phonetics of Pekingese, in the course of which 
he demonstrated that each of the four so-called “tones” of that language 
had "variants" conditioned by the tones of syllables adjoining them in 
connected speech, and sometimes by other factors. The word "toneme” was 
coined on that occasion at my suggestion: it was readily accepted by Beach 


employed to denote retroflex t and d, the IPA having at that time not come 
to a definite decision as to the mode of representing these sounds. The tradi¬ 
tional though unsatisfactory t and d had been used previously. The present 
IPA symbols (( and <|_) were decided upon in 1927. 

87 This thesis has never been published. It may be consulted in the Library 
of the University of London. 

Beach took his first degree at Harvard in 1915, majoring in chemistry. 
He then went to China, where he obtained an appointment as a teacher of 
chemistry, English and other subjects near Peking. He soon discovered 
through teaching English and learning Chinese, that he had talent for linguistic 
studies, and that in fact these studies had greater interest for him than 
chemistry. It was after four years in China, during which time he had attained 
an excellent knowledge of Pekingese, that he came to London to specialize 
in phonetics. In 1923 he was appointed Senior Lecturer in Phonetics in the 
University of Cape Town, and a few years later a professorship was conferred 
on him there. 

88 Now often called "suprasegmental" or “prosodic" features of pronunci¬ 
ation. 
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and the members of the staff of the Department at the time. 8 * The purpose 
of the introduction of the term was to be able to express the fact that the 
actual “tones” employed in the tone languages are groupable into families 
called "tonemes” in the same sort of way that "phones” are groupable into 
"phonemes." 40 

23. While Beach was still in London, he invented the terms "tonetic” 
(pertaining to the tones of tone languages), “tonetics” (the study of tones) 
and "tonetician" (a person specializing in this study). He employed these 
terms, as well as “toneme,” in his thesis on The Phonetics of Pekingese (1923), 
and in the article mentioned in footnote 39. In this article he also introduced 
the term “tonology” to denote "the comparative and historical study of tones." 

24. It is noteworthy that K. L. Pike re-invented the term "toneme” 
independently in the early 1940’s, giving to it the same meaning as Beach 
had done. 41 

26. It may be added here that Beach appears to have been one of the 
first (if not the first) to employ the term "prosodies” to denote what had 
previously been called “sound attributes,” the chief of which are length, 
stress and voice-pitch. See the section headed "Classification of Prosodical 
Elements” in his thesis on The Phonetics of Pekingese, pp. 76-91. 48 Others 
who subsequently employed the same term in the same sense were Trubetzkoy, 
R. Jakobson and K. L. Pike 43 ; later also J. R. Firth, who, however, used 
the word with extensions of meaning. 44 


88 The term "toneme” was subsequently employed again by Beach in his 
important article The Science of Tonetics and its application to Bantu Languages 
in Bantu Studies, Dec., 1924, pp. 76-106, and later in his book The Phonetics 
of the Hottentot Language (Heffer, Cambridge, 1938). Lilias E. Armstrong 
used it in the Burmese Phonetic Reader which she wrote in collaboration with 
Pe Maung Tin (University of London Press, 1925), and C. M. Doke used 
it in his book The Phonetics of Zulu (University of the Witwatersrand Press, 
1926). 

40 As Beach put it in his above mentioned article: "The key to all tonetic 
transcription is the principle of the toneme, just as the key to all phonetic 
transcription is the principle of the phoneme" ; and again on a later occasion 
(The Phonetics of the Hottentot Language, published by Heffer, Cambridge, 
in 1938, p. 127): "Just as phonetic transcription depends on the phoneme- 
principle, so tonetic transcription must depend on the toneme-principle.” 

41 See his Tone Languages (University of Michigan Publications, Linguistics, 
4, 1948). At the time when Pike re-invented the term “toneme” he had 
apparently not seen that M. Swadesh had already used the word in his 
important article The Phonemic Principle in Language, X, No. 2 (1934), p. 117. 

48 It is possible that Beach got the term "prosodic" from Sapir who used 
it (in the sense of "rhythmic”) in his Language (p. 187). 

48 The latter in his Phonemic Work Sheet (Summer Institute of Linguistics, 
Glendale, California, 1938). 

44 See Firth, Sounds and Prosodies in Trans. Phil. Soc., 1948, pp. 127-162 
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26. The theory of the phoneme and the toneme had thus become estab¬ 
lished on a firm foundation in England in the early 1920’s. A fairly adequate 
positive definition of the phoneme had been worked out on physical lines 
and appeared in print first in 1919, as stated above. It was repeated (with 
different examples) in 1923.“ Minor improvements in the wording of the 
same definition, to make it more precise, were made in my paper On Phonemes 
in Travaux du Cercle Linguistique de Prague, IV (1931), p. 74, and in my 
paper The Theory of Phonemes and its importance in Practical Linguistics 
published in the Proceedings of the First International Congress of Phonetic 
Sciences, Amsterdam, 1932, p. 23,“ and again in Chap. II of the present 
book,* 7 .* 8 

27. In more popular expositions of the phoneme idea I prefer to use the 
simpler description enunciated in § 13 (which means the same thing as the 
precise definition just given). 48 In any case, whether the phoneme is defined 
psychologically or physically, I adhere to Baudouin de Courtenay's state¬ 
ment that the term "phoneme” is a phonetic one. In fact, since I came 
to the phoneme entirely through phonetics, I cannot do otherwise than regard 
phonemics as an essential part of phonetics. We must not overlook the facts 
that phonetics can neither be studied nor applied without the use of phonetic 
transcriptions, and adequate systems of transcription cannot be constructed 
without the theory of phonemes. 80 


18 In the Pronunciation of Russian by M. V. Trofimov and myself (Cam¬ 
bridge University Press, 1923). This book was prepared during 1916-17, and 
the manuscript was sent to the printer on 31st December, 1917. Circumstances 
prevailing at the time delayed its appearance until 1923. 

“ I was pleased to see that the gradual improvement in the wording of the 
physical definition of the phoneme between 1923 and 1932 was noticed by 
W. Freeman Twaddell in his monograph On defining the Phoneme (Linguistic 
Society of America, 1936). (He did not accept the definition, but propounded 
another, taking the view that the phoneme should be regarded as an "abstrac- 
tional fictitious unit.”) 

47 First edition published by Heffer, Cambridge, 1960. 48 I started writing 

The Phoneme in 1937. The greater part of the manuscript was ready about 
1941, but the conditions prevailing at the time prevented me from completing it 
until the end of 1946. It was in the printer's hands in Jan., 1946. The neces¬ 
sary type-cutting, printing and proof-correcting took four years. 

48 See for instance the Glossary of Phonetic Terms in my English Pronouncing 
Dictionary, 11th edition (1966), p. 637, and subsequent editions. 

80 I am unable to subscribe to Trubetzkoy’s proposal to treat phonemics 
as a science on its own, quite separate from phonetics (Grundziige der Phono¬ 
logic, pp. 12-17, and Cantineau's translation, pp. 6-16), so that the phonetician 
is prohibited from concerning himself with meanings of words, while apparently 
the phonemicist need not trouble himself overmuch with the ways in which 
words are pronounced. Such a separation is, for me, impossible. The two 
subjects are, to my mind, part and parcel of a single science, which, ever 
since serious studies of them began, has been called "phonetics.” There is. 
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28. Some of those who interest themselves in linguistics may feel some 
surprise that the above definitions contain no reference to the function of 
phonemes in connexion with meanings. It is my considered opinion that any 
reference to meaning is out of place in a physical definition of the phoneme. 
It is incumbent on us to distinguish between what phonemes are and what 
they do. Phonemes are what is stated in the definition. What they do is to 
distinguish words from one another. Different sounds belonging to the same 
phoneme cannot do this. (See Chap. IV of the present book, §§ 48-53.) It 
follows also from the physical definition that phonemes are of necessity units 
of linguistic structure. 

29. During the 1930’s our American colleagues improved the terminology 
relating to the phoneme by introducing the term “allophone” to denote a 
member of a phoneme." Not that there is anything the matter with the 
term "member," but "allophone” has the advantage of providing us with 
a corresponding adjective “allophonic.” Se we can now speak of "allophonic 
transcription" (phonetic transcription in which special symbols are employed 
to denote allophones"), “the allophonic use of J in French,” etc. 

30. The above terminology being established in connexion with the 
phoneme, we can now in a similar manner improve upon Beach’s original 
terminology relating to tones by adding "tonemic” (pertaining to tonemes, 
as in "tonemic marking of tones”), "tonemics" (the study of the grouping 
of tones into tonemes), “allotones” (tones belonging to a particular toneme) 
and “allotonic” (as in "allotonic” or "narrow” representation of tones)." 

31. It eventually became evident that similar terminology is applicable 
to another of the sound attributes, namely length, and is of value in connexion 
with languages possessing significant degrees of length. If a particular degree 
of length is called a "chrone,” then we find that several chrones may be 
groupable into a single "chroneme” (a family consisting of two or more chrones 
in complementary distribution, and therefore counting as if they were the 
same, the differences of length being conditioned by the phonetic environment). 
For instance, in my sort of English, where the close i (as in head) may be 


however, no reason why specialized books should not be written on phonemics, 
as there are on intonation or other branches of phonetics. K. L. Pike has 
written an excellent one (Phonemics, University of Michigan Press, 1947). Inci¬ 
dentally, I am not sure that Pike did not go a little too far when he made 
the pronouncement that "phonetics gives us our raw material; phonemics 
cooks it” (Phonemics, p. 67). 

81 This term was invented by B. L. Whorf, probably about 1934, and was 
passed on by him to M. Swadesh and others. The first use of it in print was, 
as far as I know, in the article by Trager and Blcch entitled The Syllabic 
Phonemes of English in Language, XVII, No. 3 (1941), pp. 243—240. 

“ First suggested, I believe, by D. Abercrombie. See his article Phonetic 
Transcriptions in Le Maitre Phonitique, July, 1963. 

" The tone-marking in the Sechuana Reader by myself and Plaatje is 
allotonic. That in the Tswana (Sechuana) text on p. 49 of the Principles 
of the I.P.A., 1949, is tonemic. 
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considered to be the “long" of the open i (as in bid) there are two chronemes, 
long and short, applicable to certain vowels. Each chroneme comprises 
several easily distinguishable chrones; thus the three degrees of vowel 
length heard in bi:d (bead), bi:n (been), bi:t (beat) all belong to the long 
chroneme, while those in bid (bid) , bin (bin), bit (bit) belong to the short 
chroneme. 

32. The degrees of length belonging to a particular chroneme may be 
called "allochrones.” 64 And in transcriptions we can have "chronemic” 
(broad) marking of length, which has to be distinguished from "allochronic” 
(narrow) marking of length. Allochronic representations of length show 
differences of length which are not significant. 

33. Similar terminology is rarely apphcable to stress, though there are 
often "significant” degrees of stress. The reasons for this are set out in detail 
in my article Chronemes and Tonemes in Acta Linguistica (Copenhagen), IV, 
No. 1, dated 1944, 68 and in Chap. XXIV of the present book, especially 
§§ 468, 469. 

34. During the later 1920’s impetus was given to the study of the phoneme 
by the group of Eastern European scholars who, on the initiative of 
V. Mathesius in 1926, formed themselves into the Cercle Linguistique de 
Prague. Foremost among them, in addition to Mathesius, were N. Trubetz¬ 
koy (1890-1938), R. Jakobson and S. Karcewski. They were not pupils of 
Baudouin de Courtenay nor even pupils of his pupils, but they were of 
course familiar with his work. The remarkable achievements of these scholars 
and of other who joined them are well known, and their conclusions are to 
be found in the series of Travaux du Cercle Linguistique de Prague which 
commenced in 1929 and culminated in Trubetzkoy’s masterly Grundziige 
der Phonologic published in 1939 shortly after his premature death. 6 * 

36. The activities of the Prague School covered the whole field of linguistic 
theory, but the contributors were naturally obliged to devote much attention 
to phonemes, which when defined as explained above, cannot help constituting 
the chief means of distinguishing one word from another in every language, 
and are therefore fundamental linguistic units. Actually they did not follow 
Baudouin de Courtenay's description of the phoneme as a phonetic con¬ 
ception. They took the opposite course of treating phonemes as units of 
structure which were "realizable” as speech-sounds. 67 Fortunately this change 


64 I have had occasion to employ this term in my article The Hyphen as a 
Phonetic Sign in the Zeitschrift fur Phonetik, IX, No. 2, 1956. 

66 This article was actually written in 1946. 

64 It was seen through the press by Jakobson. The book is now available 
in a French translation by J. Cantineau: Principes de Phonologie (Klincksieck, 
Paris, 1949). 

67 Easily intelligible definitions of the phoneme on this basis are difficult 
to formulate. See, for instance, those by Mathesius in Trav. Cercle Ling. 
Prague, I (1929), p. 68; Jakobson in T.C.L.P., II (1929), p. 5; Trubetzkoy 
in his Grundziige, p. 34, and again on p. 36 (in Cantineau’S translation, 
bottom of p. 37 and top of p. 40). 
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of front did not affect materially the all-important practical applications of 
the phoneme theory. 

36. In one important respect the findings of the Prague School were in 
accord with the intentions of Baudouin de Courtenay and with the work 
that was being done in London. The term "phoneme” as understood in the 
Prague School had to do solely with vowels and consonants. 88 The sound 
or syllable "attributes” (length, stress, voice-pitch, etc.) 88 and the "boundary 
signs" (indications of syllable separation, beginnings and ends of words, etc.) 
were treated separately in distinct categories. As Karcewski put it in 
T.C.L.P., IV (1931), p. 194: "La notion de phonitme repose sur la division 
de la chaine phonique en unites qualitativement difKrentes, qui sont les opposi¬ 
tions de timbre For the separate treatment of sound and syllable attributes 
and boundary signs readers are referred to the long sections on these subjects 
in Trubetzkoy’s Grundziige (pp. 166 ff. and 241 ff.). 81 

37. In America little interest was shown in the phoneme theory during 
the 1920’s. I have a feeling, however, that Sapir (see § 2 of this Appendix) 
was on the track of it when his book Language appeared in 1921. He did 
not make use of the term "phoneme” in that work, nor was any sort of 
formulation of the idea expressed there. But hints of it are to be found in 
his chapter on sounds (Chap. III). In 1926 the principle appeared more 
clearly in his article Sound Patterns in Language There he spoke of "indi¬ 
vidual variations” of sound and "conditional variants," the latter being 
"dependent on the phonetic conditions in which the fundamental sound 
(‘point of the pattern’) occurs.” By "individual variations” he evidently 
meant what I should call “members of a diaphone,” and by "conditional 
variants” what are now called "members of a phoneme” or “allophones.” 
By “point of pattern” he presumably meant what we now call the "principal 
member” or "norm” of a phoneme. He regarded these features of speech 
from a psychological angle, and did not hesitate to use such expressions as 
"intuitively felt phonetic systems” and “the psychological equivalent of a 
single sound.” 88 

38. By 1933 it became clear from his article La R&alitt psychologique des 
Phonemes'* that Sapir had then (and doubtless had had for a long time 
previously) a full understanding of the phoneme idea, viewed psychologically. 
It is probable that he worked out the theory independently, without reference 


88 i.e. with what are now often called “linear” or "segmental” elements 
of speech. 

88 i.e. the "suprasegmental” or "prosodic” features. 

80 Karcewski’s italics. 

81 In Cantineau's translation, pp. 196 and 307. 

88 In Language, I, No. 1, pp. 37-61. 

88 Incidentally, he made use of the term "phoneme” on three occasions in 
Sound Patterns in Language with the meaning "speech-sound” in each case. 
He also employed a term "phonem" (a misprint?) on p. 46, line 11, apparently 
meaning a "sound sequence.” See also § 4 of the present Appendix. 

88 See footnote 4 above. 
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to, and possibly without knowledge of, the work previously carried out in 
this field by Baudouin de Courtenay and his followers .* 6 

39. After 1930 interest in the phoneme theory became widespread in 
America. It may be said to have originated with the appearance in 1933 of 
Leonard Bloomfield’s now classical book Language, •• i.e. comparatively 
recently when considered in relation to the whole history of the phoneme. 
We owe a great debt of gratitude to him and to the many American 
linguisticians who have followed him for their prolific work dealing in one 
way or another with the phoneme, and particularly for their continuous 
efforts to make known far and wide the difference between phonemes and 
speech-sounds—a distinction which is even now found difficult to grasp by 
many non-specialist students of languages. 

40. Confusion between the two is naturally fostered by the fact that 
written symbols have to be used sometimes for denoting speech-sounds and 
sometimes for the representation of phonemes. And what has helped perhaps 
more than all verbal descriptions to emphasize the distinction has been the 
use in American publications of slant lines // to enclose symbols denoting 
phonemes and sequences of phonemes—a usage which is, I am glad to say, 
now beginning to find its way to Europe and elsewhere. This simple and 
exceedingly effective device was the invention of Bernard Bloch, the editor 
of the journal Language. 

41. Our American colleagues will, I am sure, pardon me if I put forward 
as a recommendation that in future work they should define the phoneme 
in a manner in keeping with the intentions of the European pioneers. By 
some mischance they have for the most part extended the meaning in a way 
not previously contemplated, namely by applying the term to length and 
other prosodic features (when significant) and even to significant features of 
juncture.* 7 This innovation was, in my opinion, an unnecessary and unfor¬ 
tunate one. It appears to have originated from a terminology employed by 
Bloomfield, who, however, had taken the precaution of distinguishing signifi¬ 
cant prosodic features from the segmental elements previously known as 


•* Sapir is undoubtedly to be numbered among the pioneers in the field 
of phonemics. Since, however, in his published works he made so few and 
such scattered references to the phoneme idea, we are liable to underrate 
his services to the subject. 

•• Published by Henry Holt and Co., New York, and (in an edition specially 
adapted to the needs of British readers) by Allen and Unwin, London. 

• 7 Not all American linguisticians have done this. For instance, Einar 
Haugen has called significant suprasegmental elements of speech "prosodemes” 
(quite a good term). In the course of an article entitled Phoneme or Prosodeme 
in Language, XXV (1949), pp. 278-282 he expressed the opinion that “there 
would seem to be an advantage in distinguishing the prosodemes from the 
phonemes as thoroughly as possible.” The term "prosodeme” is also used 
by S. Potter in his Modern Linguistics ((Deutsch, London, 1957). (It had 
been employed before by Trubetzkoy, who, however, restricted its use to 
"prominence” of syllables. Grundzuge, p. 180.) 
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phonemes by calling the former "secondary phonemes.” Owing to the 
excellence of his book Language in other respects and to the immense influence 
it deservedly exerted, this terminology was soon adopted by others. These 
followers, however, omitted Bloomfield’s important qualifying word 
“secondary," and as a result the word "phoneme” came to have different 
senses in Europe and America. 

42. The situation is, however, remediable. What seems to me really to 
lie at the back of the present difference of usage has been the absence of an 
appropriate comprehensive term to denote "any speech feature whatever 
(segmental or otherwise) which can be used for distinguishing meanings." It 
is, I believe, the lack of a suitable term for this that has led many American 
writers to employ the unsuitable word “phoneme” to denote it. We need 
an unequivocal term, and I submit that such a term should be related to 
"significance” and not to "phone.” An appropriate term is "signeme."” 
The use of this word will enable us to distinguish conveniently the different 
types of significant difference by employing the terms "signemes of phone,” 
"signemes of length,” "signemes of stress," “signemes of pitch" and “signemes 
of juncture.” Then the term "phoneme” (which would be equivalent to 
"signeme of phone”) can be retained for the purpose for which it was originally 
invented in the school of Baudouin de Courtenay and adopted by those 
who have based their work on his. 


•* This term was first suggested by Dennis Ward of Edinburgh. 
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INDEX OF LANGUAGES REFERRED TO IN THIS BOOK 

References are to paragraphs 

Abua, vowel with breathy voice, 278 (3). 

American English, voiced t, 109; j, 278, 278 (2); vowel length, 521; mixture 
of dialect, 640; conscious acquisition of D and a-, 646; u and ii as separate 
phonemes, 732. 

Annamese, see Vietnamese. 

Arabic, varieties of a, 93, 118, 139, 185, 673; doubled consonants, 365; 
final long consonants, 369; representation of ?, 679. 

Argentine Spanish, s-sounds, 117. 

Bambara, r-like and l-like sounds, 630. 

Bantu, mb, fid, 17. 

Bengali, c, y t and J, 16; r\_, 84. 

Brazilian Portuguese, a corresponding to e in Portuguese of Portugal, 
743. 

Burmese, 4 and I, 116; prominence by tone, 444; distinctions by tone in 
combination with voice-quality, 585, 586. 

Cantonese, examples of minimal distinctions by tone, 61; duple distinction, 
65; triple distinction, 66 ; Z in same phoneme as i, 290; emphasis by 
length, 354, 768 footnote 40; example of toneme, 475; combination of 
length and tamber, 527. 

Chinese, n and g, 116; z in same phoneme as i, 290; tones derived from 
sounds, 766; changes of tone, 763, 764. 

Chwana, see Tswana. 

Croatian dialect, distinction by tamber combined with type of stress, 542. 

Danish, varieties of a, 92, 134, 181, 182, 602; d and 5, 124; n and §, 181, 
182. 

Dutch, g, 76, 116; a-sounds, 106; combinations of length and tamber, 523, 
524. 

Efik, kp, 19; i and 3, 49, 119. 

English, examples of minimal distinctions, 57, 58; duple distinctions, 65; 
triple distinctions, 66 ; t-sounds, 72, 108, 661; representation of t, 683; 
d and 3, 124; k-sounds, 21, 36, 68 , 134, 258, 270, 491, 496, 500; ? and t, 
33, 604, 648; ? belonging to three diaphones, 618; phonemic grouping of ?, 
620; tj, 259; t + J. 16; possible overlapping of t and r, 311; h and Q, 
32, 53; gg and g as variphone in Midland speech, 631; g and 3 , 53, 152'; 
g and n, 116; syllabic n, I, 301; l-sounds, 71, 670; overlapping of clear 
and dark I, 313; f, 37; s and z, 120; S and z, 169-175; Z-like 3 , 177; 
J-like s, 178; varieties of J, 602; h-sounds, 69; fi, 78; W, 12, 235; j, 12, 
236; W distinct from German V, 133; i: and i, 62, 104, 511, 515; i and 
3, 49; varieties of e, 88 , 134, 600; phonetic representation of e, 246, 
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882; unstressed i derived from e, 336 footnote 22; ei, 238, 246, 246; 
o: and o, 104, 612, 515; representation of o, 682; Western long oi, 251; 
varieties of OU, 601; monophthongal member of ou-phoneme, 252-256; 
u: and u, 63, 104, 512, 515; fronted u, 89, 520, 732; a-sounds, 106, 118, 
513, 515, 651; short a distinct from German c, 133; a:, A, a, 148, 
197-209, 517 ; phonemic status of a, 148, 337 ; a: distinct from French 
ce, 133; ai, au. 239, 241, 522; weakly stressed e, a, O, A, 535, 541; 
levelled forms of aia, aua, 248-250, 606; intermediate between a and 
3, 607; length of sounds before b, d, y, Z, etc., 188-191; South- 
Western r-coloured vowels, 278 (2), 726; significant length of close i, 
377, 380, 759; doubled short i, 360; doubled consonants, 365; variations 
in vowel length, 389-391, 393, 395, 396, 554; measurements of lengths 
of vowels, 398, 403-406; distinction by length in combination with 
length of adjoining sound, 532, 533; lengthening traditionally short 
vowels, 130, 519, 592 footnote 1, 734, 760; shortening traditionally 
long vowels, 520, 766, 769; significant length in Scottish, 377, 522, 
755, 759; prominence by tamber, 437; prominence by length 

(syllabification), 439; prominence by stress, 436, 458, 459; distinctions 
by stress, 463, 566-573; possible significance by secondary stress, 462 
footnote 25; diminuendo stress, 466; stress for emphasis, 342, 345; 
linking of stress with intonation, 457-459, 489; intonation of questions, 
347 ; intonation of astonishment, 351 ; vowel-quality varying with intona¬ 
tion, 114; examples of diaphones, 600-602; individual differences of pro¬ 
nunciation, 591 ff.; overlapping of e and a diaphones, 610; overlapping of 
OU and au diaphones, 611 ; overlapping of ei and ai diaphones, 612; i and e 
diaphones, 612 ; o and a diaphones, 612 ; o: belonging to two diaphones, 616 ; 
two U diaphones, 617 ; mixture of dialect, 638 ; intentional change of pronun¬ 
ciation, 642; symbols E, 3 necessary in transcribing Chaucerian English, 
678; ignoring variants in phonetic orthography, 692-694; orthography 
allowing for various accents, 695; spellings phonetic for some speakers 
but not for others, 696-698; alternative spellings, 701; departures from 
phonetic consistency in orthography, 702-705; phonemes remaining 
unchanged for long period, 708; e: > e: > il, 710, 742; possible develop¬ 
ment ai > ai, 713; u: > au and o, 714, 715; development of Middle 
English a: and ci relative to short E, 716; appearance of e: in sixteenth 
century, 717; ai < Middle English i:, 718; sixteenth century El > modem 
ei, 719; EJ > aj, 723, 750; separation of rj from n-phoneme, 731; oi 
imported from French, 736; separation of v-phoneme from f, 730, 739; 
4 > I, 743; disappearance of x, 744; eu and eu > ju:, 749; origin of 
modem a: and a:, 760; origin of syllabic 4 in South-Western and American, 
751 ; loss of a resulting in syllabification of a consonant, 754 ; shifting stress, 
762. 

Estonian, variety of E due to palatalized consonant, 184; final long consonants, 
372; three chronemes, 399, 419, 420; prominence by stress, 447; differences 
of tamber accompanying difference of length, 629. 

Ewe, nasalization of a, 320. 

Finnish, long vowels, 378. 
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French, a-sounds, 26, 131, 621, 622; e and E, 145, 214-222, 307, 309; e moyen, 
110, 214-221; 3 -sounds, 90; o, 37; o and 3, 151; 3 , 0 , os, 297-300; over¬ 
lapping of 3 and oe, 305, 306, 309; substitution of ce for 3, 710; y, 231, 
282, 670; U and W in one phoneme, 283, 284; nasalized vowels, 278 (1); 
1 and I, 32, 73, 116; s and z, 120; d-sounds, 74; $, 176; j, 14, 243, 285, 
500; doubled vowels, 360; doubled consonants, 365; distinctions by 
length, 59, 379, 410, 549; relation between length and stress, 386, 394; 
chronemes, 410-414; lengthening of a in bagtte, etc., 562, 758; introduction 
of significant length, 769; incidence of stress, 431; stress for emphasis, 
343, 346, 768 footnote 40; intonation of questions, 347; individual 
differences of pronunciation 592; mixture of dialect, 639; intentional 
changes of pronunciation, 644, 645; ignoring assimilations and elisions 
in phonetic orthography, 693; compromise spelling, 669; alternative 
spellings, 701, 703; unpronounced letters in phonetic orthography, 704; 
phoneme remaining unchanged for long period, 708; y < Latin u:, 709; 
h in Norman French, 736; j < I and W < O, 740; I > U, 727; X > j, 743; 
d and 8 in Old French, 746; js < e, 747; -ije > -ie, 747 footnote 24; 
wa < ei, 748; length compensating for loss of s, 752; length compensa¬ 
ting for loss of 3, 763. 

Ga, language containing nasalized vowels, 274. 

Georgian, p’, t’, 17. 

German, qualities of long and short vowels, 105, 525; i: and i, 63; e and £, 
132; ei, E, 3, 91, 149, 223-229; varieties of a:, 603; R, 91 footnote 12; 
j in same phoneme as i, 288; p, [p, b constituting a variphone, 632; p and 
b in phonetic orthography, 700; ts, 16; n and g, 116; s and Z, 120; 
J and 5 . 143; h and 5 , 150; 5 and x, 230-232, 671; doubled consonants, 
365; final long consonants, 368; minimal distinction by length, 59; 
reduced vowel length in pretonic position, 408, 421; chronemes, 407-409; 
distinctions by length in combination with pitch, 561-665; significant 
stress, 454; stress for emphasis, 342; intonation of questions, 347; 
departures from phonetic consistency in orthography, 702. 

Greek, b, d, g, 123; rrj, 212; significant stress, 454; tones replaced by stress, 
765. 

Hindi, C, J, 16; doubled C, J, 271; other doubled consonants, 366, 366; 
n and Q, 116; q, f, Z,X, y imported, 736; q,f, z,x, y in romanic orthography, 
700; nasalized vowels, 278 (1), 279, 318, 319; vowels with breathy voice, 
278 (3), 279; 3th, 131, 162; aoh, 163; varieties of 31 , 603; intonation 
of questions, 347, 361; departure from phonetic consistency in 
orthography, 703. 

Hottentot, representation of clicks with simultaneous g, 17. 

Hungarian, g, 75, 116; a-sounds, 106; final long consonants, 372; combina¬ 
tions of length and tamber, 523, 524. 

Igbo, J and 5, 80; |-like r, 82; u and o, 186; converting statement into 
question, 353; vowel harmony, 509. 

Irish dialects of English, use of t for 0, 639. 
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Italian, g, 75, 116, 211; rg, 212; s and Z in Roman, 120, 121, 701; doubled 
affricates, 271; j in same phoneme as i, 286, 289; w in same phoneme 
as u, 286; final o, 210, 540; obscured final a, 540; variations in consonant 
length, 392; distinctions by comparative length of two adjacent sounds, 
550, 651; association of length with stress, 555; assimilation and initial 
doubling ignored in phonetic orthography, 692; alternative spellings in 
phonetic orthography, 701. 

Japanese, long and short vowels, 59, 378, 488; h, $, <j>, 77, 116; English h, 
f difficult for Japanese, 143; s and Z, 120; s and J, 330, 331; d, Z, dj, 
dz, 323-329; r, 629; <t> before vowels other than u, 738; syllabic nasal, 
122, 292-295; phonetic representation of u, 683; romanic spelling, 140. 

Kabardian, difficulty in learning use of vowel variants, 674. 

Kikuyu, minimal distinction by length, 59; triple distinction, 66; 4>, p. b. 
79, 141; 0, 3, d, 116, 141; y and g, 141; interrogative tone patterns, 
363; significant length of vowels, 378; ignoring elision and coalescence 
in orthography, 692; b being introduced, 737. 

Korean, bilabial consonants, 194. 

Latin, ui > y in French, 709, 720 footnote 8; b > P in Spanish, 711; short 
i, U > e, O in Vulgar Latin, 720; possible attachment of £ to e-phoneme, 
721; separation of e from e-phoneme in Vulgar Latin, 721; a > E in 
French, 722; a > 3 in French, 725; t > d > 8, 724; j > 5 and z, 740; 
W > V, 740; e > ie, jc, 747; e: > wa in French, 748. 

Luganda, example of minimal distinction by length, 59; I and r sounds, 83; 
initial long consonants, 367; significant length of vowels, 378. 

Malinke, r-like and l-like sounds, 630. 

Marathi, n and q,, 128. 

Norwegian, tones, 576-579. 

Nuer, vowel with breathy voice, 278 (3). 

Panjabi, n and q, 128; tones derived from sounds, 756. 

Pedi, J and 1, 127, 680. 

Pekingese, minimal distinction by change in three attributes, 64; vowel- 
quality varying with tone, 112, 113, 242, 543, 545; U, ill, iau, iou, 257; 
ui, uei, 257 footnote 15; Ag, an, ag, an, 603; members of e-phoneme, 
674; 6 , X, J, S, 196, 196; distinctions by tone combined with length, 
667; giving prominence to words by making other words toneless, 768 
footnote 40. 

Portuguese, b replaced by a in Brazilian, 743. 

Provencal, r and R, 145; nasal consonants, 157-160; significant stress, 464. 

Quechua, q, 133. 

Rhineland German, distinctions by length in combination with pitch, 
561-565. 

Russian, soft and hard consonants, 95, 179, 183, 605; S and Z, 120; varieties 
of i and j, 97; l, i, e, 332-336; varieties of e, 98, 183; varieties of a, 96, 
118, 183; overlapping of e and a, 308, 309; varieties of o and U, 99; 
weak vowels, 111, 332, 333, 536-539; vowel length, 417; u and u as 
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separate phonemes, 733; doubled consonants, 365; initial long consonants, 
367; stress for emphasis, 342; intonation of questions, 347; distinctions 
by tamber combined with stress, 536-539. 

Scottish English, Ai, ae, 239, 522; long vowels, 377; shortening traditionally 
long vowels, 62, 520, 755, 759; significant length, 377, 522, 759. 

Serbo-Croat, three significant degrees of stress, 461; different kinds of stress, 
466, 467, 583 ; tonemes, 486; difference of tamber accompanying difference 
of tone, 547; difference of length accompanying difference of tone, 558-560. 

Shan, tones, 484; combination of length and tamber, 527. 

Siamese, see Thai. 

Sinhalese, mb, fid, fic|_, fig, 17, 260-269; q_, 84, 260; qualities of long and short 
vowels. 101-103, 489, 491; chronemes, 415; prominence by length, 438. 

Somali, doubled consonant, 366; long vowels, 378; distinctions by kinds of 
stress combined with tone, 580-582. 

Sotho, representation of d, 680. 

Spanish, distinctions by stress, 60, 454; vowel areas, 304; obscured final a, 
640; vowel length, 417; g, 75, 116, 211, 502; s and Z, 120; d and 8, 124, 
143, 633; sounds of tt, 603; b and p, g and y forming variphones, 633; 
P < Latin w (or v), 743. 

Spanish of Buenos Aires, s-sounds, 117. 

Sudanic languages, p and f, b and V, 125; breathy voice, 274. 

Swahili, n and g, 116; disappearance of tones, 766. 

Swedish, long consonants, 371, 374; distinction by comparative length of 
two adjacent sounds, 552; distinctions by tone linked with stress, 575. 

Syriac, t, 138. 

Tamil, grouping several distinct sounds into single phonemes, 85; k, g, X, 5 , 
y, 86; orthography, 87; c, (3, %, 272; long consonants, 373; prominence 
by length, 438; values of consonants varying with length, 530, 631. 

Telugu, r\_, 84; vowel harmony, 508. 

Thai (Siamese), combination of length and tamber, 527. 

Tswana (Chwana, Rolong dialect of Mafeking), p, ph, 17; minimal distinc¬ 
tions by tone, 61; J and I, 81, 127; n and g, 116; J and 164-166; 
converting statement into question, 352; double vowels, 360; initial long 
consonants, 367 ; semantic length in sentence, 383-386, 422 ; stress, 386, 
431; prominence by tone, 449; grouping tones into tonemes, 476-483. 

Turkish, c, t + J, 16. 

Urdu, g_, 84; s and z, 120. 

Vietnamese, vowels with creaky voice, 278 (4); combination of length and 
tamber, 528; distinctions by tone in combination with voice-quality 
587. 

Vulgar Latin, u > French y, 709; e, O, 720; E, 721. 

Welsh, 4 and I, 116; intonation, 351. 

Xhosa, gj, g!>, nj, etc., 17. 

Zulu, gj, gS, etc., 17; 4 and I, 116; s and z, 120. 
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References are to paragraphs 


Abstract sounds, 20, 38. 

Acoustic distinctions between words, 
condition for, 493. 

Acoustic impressions. Chap. IX, 
illusions, 136. 

Adjoining sounds, furnishing identi¬ 
fication of sounds next to them, 
Chap. XIII. 

Affricate, 16. 

All-India Romanic Orthography, 162, 
163. 

Allophone, 24. 

Allochrones, 399. 

Allotone, 473. 

Ascertaining phonemes, Chap. X. 

Aspiration, 18. 

Assigning sounds to phonemes, 193 ff. 

Aural illusions, 136. 

Breathy voice, 273, 274, 585-587. 

Broad transcription, 43, 652, 669 ff. 

Chrone Languages, 59, 381. 

Chronemes, Chap. XXIII; representa¬ 
tion in writing, 423; overlapping, 
416, 421. 

Chrones, definition, 398; grouping 
into chronemes, 399, 402-417; 

differences according to type of 
sentence, 422. 

Coloured vowels, Chap. XVI; nature, 
273; representation, 275-277. 

Colourings as phonemes, 280, foot¬ 
note 12. 

Complexes of attributes. Chap. XXVI; 
list of combinations, 498; two 
neighbouring tambers, 500 ff.; tam- 
ber with duration, 510 ff.; tamber 
with length of an adjoining sound, 
532-534; tamber with a degree of 
stress, 535—541; tamber with type 
of stress, 542; tamber with voice- 
pitch, 543-547; duration with 
duration of an adjoining sound, 
548-553; duration with degree of 


stress, 554, 555; duration with 
type of stress, 556; duration with 
voice-pitch, 557—565; degree of 
stress with degree of stress on 
adjacent syllable, 566-569; degree 
of stress with voice-pitch, 570 ff.; 
kind of stress with voice-pitch, 
580-583; voice-pitch with voice- 
pitch on neighbouring syllable, 
584; voice-pitch with voice-quality, 
585-587. 

Complexity of controlling principles, 
214 ff. 

Compound sounds, 9, 12-19, Chap. 
XV, 258 ff., 270 ff. 

Concrete sounds, 20, 38. 

Conditioned variants, 23, footnote 15. 

Contrast, 344, 345. 

Controlling principles, 30, 214 ff. 

Creaky voice, 273, 274, 585-587. 

Crescendo-diminuendo stress, 463-466. 

Crescendo stress, 463-467, 580. 

Determining sound, 728. 

Diaphones, Chap. XXVII; nature of, 
598; examples of, 599 ff.; over¬ 
lapping, 610-615; sounds belonging 
to two diaphones, 615-618; dia¬ 
phones and phonemes, 619-626. 

Difference of usage in different 
languages. Chap. VIII. 

Difficulties in establishing phonemic 
groupings. Chap. XIV. 

Difficulties in identifying sounds, 
Chap. XII. 

Digraphic representation of Russian 
consonants, 95. 

Diminuendo stress, 463-467, 580. 

Di-phonemic sounds, Chap. XX. 

Diphthongs, 12, 14, 15, 237, (as) 248, 
(aa) 249. 

Distinctive function, Chap. IV, 168 ff. 

Divergent, 24. 

| Double articulation, 19. 
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Doubled consonants, 364-374. 

Doubled vowels, 362, 363. 

Doubling, 360-374; subjective per¬ 
ception of, 362. 

Duple distinctions, 65. 

Duration, functions of, 339-343; 
significant duration. Chap. XXII, 
372, 376-380; sounds of limited 
duration, 339, 357; duration and 
doubling, 360-374; distinguishing 
one kind of sentence from another, 
383, 384; relationship with stress 
in non-stress languages, 386; 
modification for emphasis, 397; 
measurements, 403-406; of English 
vowels, 402-406; of German 
vowels, 407-409; of French vowels, 
410-414; of Sinhalese vowels, 415, 
416; of Russian vowels, 417; of 
Spanish vowels, 417; of Estonian 
sounds, 419, 420; differences 

according to type of sentence, 422; 
representation in writing, 423. 

Elusive sounds, 202. 

Erratic pronunciation, Chap. XXVIII, 
175, 253, 609. 

Emotional states, indications of, 341, 
350. 

Emphasis, for intensity, 342-344, 
432; for contrast, 344, 345, 432; 
for reflecting emotional states, 341, 
350; or suggesting other implica¬ 
tions, 341, 347-355. 

Free variations, 598. 

Free variants, 601. 

Fully long vowels, compared with 
moderately long vowels, 561. 

Functional view of the phoneme, 654 ff. 

Glides, 5. 

Gliding sounds, 9,12,14,15, Chap. XV. 

Glottal stop in English, 33, 604, 618, 
620, 639, 648, 712; in Arabic, 679. 

Hepburnian writing of fapanese, 140. 

History of Language, 678, Chap. 
XXXII; phonemes unchanged, 
708; entire phoneme changed, 709, 
710; change in principal member, 
711, 714-719; change in sub¬ 

sidiary member, 712, 713, 720-725; 


appearance of a new phoneme, 
726, 735-740, 755; separation of 
a subsidiary member from principal 
member, 711, 720, 725, 728-734; 
disappearance of a phoneme, 741- 
745; disappearance of a subsidiary 
member, 746; replacement of a 
sound by a sequence of two 
sounds, 747-749; replacement of 
a sound by a length, 750-754; 
replacement of a sound by a tone, 
756; non-significant change of 
length, 758; appearance of signi¬ 
ficant length, 759; disappearance 
of significant length, 760; change 
in a type of stress, 761; shifting 
position of strong stress, 762; 
changes of values of tones, 763; 
appearance of new tones and dis¬ 
appearance of old ones, 764; sub¬ 
stitution of stress for tone, 765; 
disappearance of tones, 766; changes 
in sentence intonation, 768. 

Ideal sounds, 653. 

Identification of sounds by charac¬ 
teristics of adjoining sounds, Chap. 
XIII. 

Idiolect, 28. 

Imperfect diphthong, 14. 

Implications, methods of suggesting, 
341-355; in tone languages, 352- 
355. 

Incidental sounds, 318-320. 

Individual differences of speech, 588 ff. 

Indivisible length, as compared with 
doubling, 360. 

Intensity, 342-344, 432. 

Intentional changes of pronunciation, 
641-648. 

Intonation, functions of, 339-351, 
355; grouping tones into tonemes. 
Chap. XXV. 

Kunreisiki RSmazi, 326. 

Language, meaning of the term, 28, 
29, 658. 

Length, functions of, 339-343; signi¬ 
ficant length, Chap. XXII, 372, 
376-380; sounds of limited length, 
339, 357; length and doubling. 
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360-374; distinguishing one kind ! 
of sentence from another, 383, 384; 
relationship with stress in non¬ 
stress languages, 386; modification 
for emphasis, 397; measurements, 
403-406; of English vowels, 402- 
406; of German vowels, 407—409; 
of French vowels, 410-414; of 
Sinhalese vowels, 415, 416; of 
Russian vowels, 417; of Spanish 
vowels, 417; of Estonian sounds, 
419, 420; differences according to ' 
type of sentence, 422; representa¬ 
tion in writing, 423. 

Lengthening traditionally short vowels 
in English, 130, 519, 592, 734, 760. 

Level stress, 463, 466, 467. 

Linguistic sense, 211, 212. 

Linked sequences, 501 ff. 

Long consonants, initial, 367; medial, 
365, 366, 373, 374; final, 368-372. 

Long sounds, variations in length of, 
387 ff. 

Manner of use as a criterion, Chap. 
XVIII. 

Medium long vowels, compared with 
fully long vowels, 561. 

Member of a phoneme, 23. 

Mentalistic conception of the phoneme, 
Chap. XXIX, 87, 649 ft .; possible 
bearing of history on this concep¬ 
tion, 769. 

Minimal distinction, Chap. VI, 488- 
498. 

Mixture of dialect, 638-640. 

Monographic representation of Russian 
consonants, 95. 

Monophone, 37. 

Narrow transcription, 44, 652, 670 ff. 

Nasalized vowels, 273, 274. 

Need for accurate analysis, Chap. XI. 

Non-phonetic criterion, 219-221. 

Norm (of phoneme), 24; (of toneme), 
473. 

Oppositions, 53. 

Orthography, 691 ff.; ignoring variant 
pronunciations, 693; covering 
various accents, 695-700; showing 
relationships between words, 704; 


variant spellings unavoidable, 701; 
distinguishing homophones, 703- 
705; prejudice created by, 212. 

Oscillograms, Frontispiece, 2, 6. 

Overlapping of chronemes, 416, 421. 

Overlapping of diaphones, 610-615. 

Overlapping of phonemes, Chap. XIX; 
in different contexts, 303-313; in 
the same context, 314, Chap. XX. 

Overlapping of tonemes, 472. 

Penetration, 427. 

Phoneme, Chap. II; explanation, 25, 
26, 31; possible bearing of history 
on the determination of the nature 
of the phoneme, 769; bearing of 
theory of diaphones on theory of 
phonemes, 624; existence of, 659; 
practical use of the theory, Chap. 
XXX; basic elements of language, 
663-665; use in construction of 
systems of writing, 666. 

Phoneme theory, object of, 208. 

Phonemic analysis, based on words, 
34, 688, 689. 

Phonemic classifications, need for 
establishing them from the speech 
of a single individual, 28, 218 ff. 

Phonemic grouping, Chap. VII, and 
elsewhere. 

Phonetic context, 30. 

Phonetic transcription, Chap. Ill, 
669 ff.; principles of, 684. 

Phonetic writing, Chap. XXXI. 

Phonetics, an art as well as a science, 
661; use in learning foreign 
languages, 664. 

Physical conception of the phoneme, 
649. 

Physiophonetics, 652. 

Physiophonic transcription, 652. 

Plosion, 13, 16, 18. 

Principal member of a phoneme, 24. 

Principal member of a toneme, 473. 

Prolongability of vowels, 561, foot¬ 
note 46. 

Prominence, 434 ff.; by stress, 436; 
by tamber, 437; by length, 438, 
439; by intonation, 440-444; by 
combinations of attributes, 446, 
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447; to be distinguished from 
stress, 448, 449. 

Prosodic elements of speech, 356. 

Psychological view of the phoneme, 652. 

Psychophonetics, 652. 

Psychophonic transcription, 652. 

Pure vowels in diphthong phonemes, 
251 ff., 257. 

T-coloured vowels, 273, 274. 

Recognition of sounds through length 
of an adjacent sound, 187 ff. 

Rhythm, 395. 

Romanic Orthography for Languages 
of India, 162, 163. 

Secondary stress, 462. 

Semantic function of phonemes, Chap. 
IV. 

Short sounds, variations in length of, 
387 ff. 

Shortening in conjunction with weak 
stress, 394. 

Shortening traditionally long vowels in 
Scottish English, 62, 520, 759. 

Significant distinctions, 51, Chap. V. 

Simple sounds, 9-11, 243 ff. 

Single member phoneme , 37. 

Sound tracks, 2, 3, 6. 

Sounds assignable to either of two 
phonemes, 321 ff. 

Speech, definition, 1. 

Speech chain, 2-4. 

Speech-sounds, Chap. I. 

Stops, 13, 16, 18, 270. 271. 

Stress, Chap. XXIV; functions of, 
339-343; tendency to rhythmical 
arrangement, 395; nature of, 205, 
424 ff.; difficulty of determining, 
450, 451; to be distinguished from 
prominence, 448, 449; significant in 
words, 452 ff.; of limited effective¬ 
ness unless combined with other 
attributes, 465, 456, 670, 571; 
combined with intonation, 457- 
460; language with three significant 
degrees, 461; kinds of (level, 
crescendo, etc.), 463-467; deter¬ 
mination by laboratory methods, 
205, footnote 14. 

Stress languages, 60, 428 ff., 469. 


Stronemes, 468, 469. 

Style of speech, 28, 605-607. 

Sub-phonemic variant, 24. 

Subsidiary member of a phoneme, 24. 

Sulcalized vowels, 273, 274. 

Syllabic consonants, 439. 

Syllabic nasal (Japanese), 292-295. 

Syllabic use, 292-295, 301. 

Symbol economy, 518, 524. 

Tamber, origin of term, 41. 

Tamber and length of English vowels, 
relationship between, 510 ff. 

Tone languages, 61, 470. 

Tonemes, Chap. XXV; nature of, 
471 ff.; definition, 472; principal 
member, 473; grouping tones into, 
474 ff. 

Tones, of Pekingese, 64, 112, 113, 242, 
543, 545, 557, 768, footnote 40; of 
Cantonese, 61, 475; of Tswana, 
476 ff.; of Shan, 484; of Somali, 
485, 580-582; of Serbo-Croat, 486, 
547, 558-560; in German, 561-565. 

Triple distinctions, 66 . 

True length, as compared with 
doubling, 360. 

Unstable sounds, 202, 253. 

Variants conditioned by duration, 
101 ff. 

Variants conditioned by intonation, 
112-114. 

Variants conditioned by neighbouring 
sounds, 69 ff. 

Variants conditioned by stress, 108 ff., 
241. 

Varichrone, 416. 

Variphones, 628 ff.; place in the 
phoneme theory, 636, 637. 

Voiceless d, §, y, z, etc., 169-175. 

Voice-pitch, functions of, 339 ff. 

Voice-quality, 273, 274, 585-587. 

Vowel harmony, 507-509. 

Vowels and consonants as members of 
the same phoneme, Chap. XVII. 

Vowels with breathy voice, 161, 273, 
274, 585-587. 

Vowels with creaky voice, 273, 274, 
585-587. 

Word, 7. 
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